Byte Pair Encoding

Introduced by Sennrich et al. in Neural Machine Translation of Rare Words with Subword Units

Byte Pair Encoding, or BPE, is a subword segmentation algorithm that encodes rare and unknown words as sequences of subword units. The intuition is that various word classes are translatable via smaller units than words, for instance names (via character copying or transliteration), compounds (via compositional translation), and cognates and loanwords (via phonological and morphological transformations).

Lei Mao has a detailed blog post that explains how this works.

Source: Neural Machine Translation of Rare Words with Subword Units

Latest Papers

PAPER DATE
Developing Real-time Streaming Transformer Transducer for Speech Recognition on Large-scale Dataset
Xie ChenYu WuZhenghao WangShujie LiuJinyu Li
2020-10-22
Scientific Claim Verification with VERT5ERINI
Ronak PradeepXueguang MaRodrigo NogueiraJimmy Lin
2020-10-22
mT5: A massively multilingual pre-trained text-to-text transformer
| Linting XueNoah ConstantAdam RobertsMihir KaleRami Al-RfouAditya SiddhantAditya BaruaColin Raffel
2020-10-22
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
| Alexey DosovitskiyLucas BeyerAlexander KolesnikovDirk WeissenbornXiaohua ZhaiThomas UnterthinerMostafa DehghaniMatthias MindererGeorg HeigoldSylvain GellyJakob UszkoreitNeil Houlsby
2020-10-22
Multi-Unit Transformer for Neural Machine Translation
| Jianhao YanFandong MengJie zhou
2020-10-21
TMT: A Transformer-based Modal Translator for Improving Multimodal Sequence Representations in Audio Visual Scene-aware Dialog
Wubo LiDongwei JiangWei ZouXiangang Li
2020-10-21
Analyzing the Source and Target Contributions to Predictions in Neural Machine Translation
| Elena VoitaRico SennrichIvan Titov
2020-10-21
Token Drop mechanism for Neural Machine Translation
| Huaao ZhangShigui QiuXiangyu DuanMin Zhang
2020-10-21
WaveTransformer: A Novel Architecture for Audio Captioning Based on Learning Temporal and Time-Frequency Information
| An TranKonstantinos DrossosTuomas Virtanen
2020-10-21
Multi-Domain Dialogue State Tracking based on State Graph
Yan ZengJian-Yun Nie
2020-10-21
AutoMeTS: The Autocomplete for Medical Text Simplification
Hoang VanDavid KauchakGondy Leroy
2020-10-20
Transition-based Parsing with Stack-Transformers
Ramon Fernandez AstudilloMiguel BallesterosTahira NaseemAustin BlodgettRadu Florian
2020-10-20
PROP: Pre-training with Representative Words Prediction for Ad-hoc Retrieval
| Xinyu MaJiafeng GuoRuqing ZhangYixing FanXiang JiXueqi Cheng
2020-10-20
Topic-Aware Abstractive Text Summarization
| Chujie ZhengKunpeng ZhangHarry Jiannan WangLing Fan
2020-10-20
Bootleg: Chasing the Tail with Self-Supervised Named Entity Disambiguation
Laurel OrrMegan LeszczynskiSimran AroraSen WuNeel GuhaXiao LingChristopher Re
2020-10-20
BERT2DNN: BERT Distillation with Massive Unlabeled Data for Online E-Commerce Search
Yunjiang JiangYue ShangZiyang LiuHongwei ShenYun XiaoWei XiongSulong XuWeipeng YanDi Jin
2020-10-20
Towards Scalable Distributed Training of Deep Learning on Public Cloud Clusters
Shaohuai ShiXianhao ZhouShutao SongXingyao WangZilin ZhuXue HuangXinan JiangFeihu ZhouZhenyu GuoLiqiang XieRui LanXianbin OuyangYan ZhangJieqian WeiJing GongWeiliang LinPing GaoPeng MengXiaomin XuChenyang GuoBo YangZhibo ChenYongjian WuXiaowen Chu
2020-10-20
CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters
| Hicham El BoukkouriOlivier FerretThomas LavergneHiroshi NojiPierre ZweigenbaumJunichi Tsujii
2020-10-20
Infusing Sequential Information into Conditional Masked Translation Model with Self-Review Mechanism
Pan XieZhi CuiXiuyin ChenXiaohui HuJianwei CuiBin Wang
2020-10-19
Dimsum @LaySumm 20: BART-based Approach for Scientific Document Summarization
Tiezheng YuDan SuWenliang DaiPascale Fung
2020-10-19
Query-aware Tip Generation for Vertical Search
Yang YangJunmei HaoCanjia LiZili WangJingang WangFuzheng ZhangRao FuPeixu HouGong ZhangZhongyuan Wang
2020-10-19
Better Distractions: Transformer-based Distractor Generation and Multiple Choice Question Filtering
Jeroen OfferijnsSuzan VerberneTessa Verhoef
2020-10-19
Parameter Norm Growth During Training of Transformers
William MerrillVivek RamanujanYoav GoldbergRoy SchwartzNoah Smith
2020-10-19
Capturing Longer Context for Document-level Neural Machine Translation: A Multi-resolutional Approach
| Zewei SunMingxuan WangHao ZhouChengqi ZhaoShuJian HuangJiajun ChenLei LI
2020-10-18
Cross-Lingual Relation Extraction with Transformers
Jian NiTaesun MoonParul AwasthyRadu Florian
2020-10-16
Towards Natural Bilingual and Code-Switched Speech Synthesis Based on Mix of Monolingual Recordings and Cross-Lingual Voice Conversion
| Shengkui ZhaoTrung Hieu NguyenHao WangBin Ma
2020-10-16
DiDi's Machine Translation System for WMT2020
Tanfang ChenWeiwei WangWenyang WeiXing ShiXiangang LiJieping YeKevin Knight
2020-10-16
Modeling Token-level Uncertainty to Learn Unknown Concepts in SLU via Calibrated Dirichlet Prior RNN
Yilin ShenWenhu ChenHongxia Jin
2020-10-16
Revisiting Optical Flow Estimation in 360 Videos
Keshav BhandariZiliang ZongYan Yan
2020-10-15
Empirical Study of Transformers for Source Code
Nadezhda ChirkovaSergey Troshin
2020-10-15
Multi-Task Learning for Cross-Lingual Abstractive Summarization
Sho TakaseNaoaki Okazaki
2020-10-15
Context-Guided BERT for Targeted Aspect-Based Sentiment Analysis
Zhengxuan WuDesmond C. Ong
2020-10-15
Natural Language Rationales with Full-Stack Visual Reasoning: From Pixels to Semantic Frames to Commonsense Graphs
| Ana MarasovićChandra BhagavatulaJae Sung ParkRonan Le BrasNoah A. SmithYejin Choi
2020-10-15
DialogueTRM: Exploring the Intra- and Inter-Modal Emotional Behaviors in the Conversation
Yuzhao MaoQi SunGuang LiuXiaojie WangWeiguo GaoXuan LiJianping Shen
2020-10-15
[email protected]: Sentiment Analysis of Code-Mixed Dravidian text using XLNet
Shubhanker BanerjeeArun JayapalSajeetha Thavareesan
2020-10-15
Compressive Summarization with Plausibility and Salience Modeling
| Shrey DesaiJiacheng XuGreg Durrett
2020-10-15
Masked Contrastive Representation Learning for Reinforcement Learning
| Jinhua ZhuYingce XiaLijun WuJiajun DengWengang ZhouTao QinHouqiang Li
2020-10-15
Understanding Neural Abstractive Summarization Models via Uncertainty
| Jiacheng XuShrey DesaiGreg Durrett
2020-10-15
Memformer: The Memory-Augmented Transformer
Qingyang WuZhenzhong LanJing GuZhou Yu
2020-10-14
DA-Transformer: Distance-aware Transformer
Chuhan WuFangzhao WuYongfeng Huang
2020-10-14
Length-Adaptive Transformer: Train Once with Length Drop, Use Anytime with Search
| Gyuwan KimKyunghyun Cho
2020-10-14
Decoding Methods for Neural Narrative Generation
| Alexandra DeLuciaAaron MuellerXiang Lisa LiJoão Sedoc
2020-10-14
Probing for Multilingual Numerical Understanding in Transformer-Based Language Models
| Devin JohnsonDenise MakDrew BarkerLexi Loessberg-Zahl
2020-10-13
The workweek is the best time to start a family -- A Study of GPT-2 Based Claim Generation
Shai GretzYonatan BiluEdo Cohen-KarlikNoam Slonim
2020-10-13
Context-Aware Drive-thru Recommendation Service at Fast Food Restaurants
Luyang WangKai HuangJiao WangShengsheng HuangJason DaiYue Zhuang
2020-10-13
Aspect-based Document Similarity for Research Papers
| Malte OstendorffTerry RuasTill BlumeBela GippGeorg Rehm
2020-10-13
Interpreting Attention Models with Human Visual Attention in Machine Reading Comprehension
Ekta SoodSimon TannertDiego FrassinelliAndreas BullingNgoc Thang Vu
2020-10-13
COMET-ATOMIC 2020: On Symbolic and Neural Commonsense Knowledge Graphs
Jena D. HwangChandra BhagavatulaRonan Le BrasJeff DaKeisuke SakaguchiAntoine BosselutYejin Choi
2020-10-12
Chatbot Interaction with Artificial Intelligence: Human Data Augmentation with T5 and Language Transformer Ensemble for Text Classification
Jordan J. BirdAnikó EkártDiego R. Faria
2020-10-12
Meta-Context Transformers for Domain-Specific Response Generation
Debanjana KarSuranjana SamantaAmar Prakash Azad
2020-10-12
Probing Pretrained Language Models for Lexical Semantics
Ivan VulićEdoardo Maria PontiRobert LitschkoGoran GlavašAnna Korhonen
2020-10-12
Dynamic Memory Enhanced Transformer for End-to-end Task-Oriented Dialogue System
Yanjie GouYinjie LeiLingqiao Liu
2020-10-12
SJTU-NICT's Supervised and Unsupervised Neural Machine Translation Systems for the WMT20 News Translation Task
Zuchao LiHai ZhaoRui WangKehai ChenMasao UtiyamaEiichiro Sumita
2020-10-11
Machine Translation of Mathematical Text
Aditya OhriTanya Schmah
2020-10-11
Incremental Processing in the Age of Non-Incremental Encoders: An Empirical Assessment of Bidirectional Models for Incremental NLU
| Brielen MadureiraDavid Schlangen
2020-10-11
Information Extraction from Swedish Medical Prescriptions with Sig-Transformer Encoder
John Pougue BiyongBo wangTerry LyonsAlejo J Nevado-Holgado
2020-10-10
Structured Self-Attention Weights Encode Semantics in Sentiment Analysis
| Zhengxuan WuThanh-Son NguyenDesmond C. Ong
2020-10-10
Automated Concatenation of Embeddings for Structured Prediction
Xinyu WangYong JiangNguyen BachTao WangZhongqiang HuangFei HuangKewei Tu
2020-10-10
On Task-Level Dialogue Composition of Generative Transformer Model
| Prasanna ParthasarathiArvind NeelakantanSharan Narang
2020-10-09
The NU Voice Conversion System for the Voice Conversion Challenge 2020: On the Effectiveness of Sequence-to-sequence Models and Autoregressive Neural Vocoders
Wen-Chin HuangPatrick Lumban TobingYi-Chiao WuKazuhiro KobayashiTomoki Toda
2020-10-09
Online Back-Parsing for AMR-to-Text Generation
Xuefeng BaiLinfeng SongYue Zhang
2020-10-09
What Have We Achieved on Text Summarization?
Dandan HuangLeyang CuiSen yangGuangsheng BaoKun WangJun XieYue Zhang
2020-10-09
Shallow-to-Deep Training for Neural Machine Translation
Bei LiZiyang WangHui LiuYufan JiangQuan DuTong XiaoHuizhen WangJingbo Zhu
2020-10-08
Improving Attention Mechanism with Query-Value Interaction
Chuhan WuFangzhao WuTao QiYongfeng Huang
2020-10-08
TextSETTR: Label-Free Text Style Extraction and Tunable Targeted Restyling
Parker RileyNoah ConstantMandy GuoGirish KumarDavid UthusZarana Parekh
2020-10-08
A Co-Interactive Transformer for Joint Slot Filling and Intent Detection
| Libo QinTailu LiuWanxiang CheBingbing KangSendong ZhaoTing Liu
2020-10-08
Interlocking Backpropagation: Improving depthwise model-parallelism
Aidan N. GomezOscar KeyStephen GouNick FrosstJeff DeanYarin Gal
2020-10-08
Deformable DETR: Deformable Transformers for End-to-End Object Detection
Xizhou ZhuWeijie SuLewei LuBin LiXiaogang WangJifeng Dai
2020-10-08
Optimizing Transformers with Approximate Computing for Faster, Smaller and more Accurate NLP Models
Amrit NagarajanSanchari SenJacob R. StevensAnand Raghunathan
2020-10-07
Transformer-GCRF: Recovering Chinese Dropped Pronouns with General Conditional Random Fields
Jingxuan YangKerui XuJun XuSi LiSheng GaoJun GuoJi-Rong WenNianwen Xue
2020-10-07
Super-Human Performance in Online Low-latency Recognition of Conversational Speech
Thai-Son NguyenSebastian StuekerAlex Waibel
2020-10-07
Low-Resource Domain Adaptation for Compositional Task-Oriented Semantic Parsing
Xilun ChenAsish GhoshalYashar MehdadLuke ZettlemoyerSonal Gupta
2020-10-07
Vector-Vector-Matrix Architecture: A Novel Hardware-Aware Framework for Low-Latency Inference in NLP Applications
Matthew KhouryRumen DangovskiLongwu OuPreslav NakovYichen ShenLi Jing
2020-10-06
Adversarial Grammatical Error Correction
Vipul RahejaDimitrios Alikaniotis
2020-10-06
Investigating African-American Vernacular English in Transformer-Based Text Generation
Sophie GroenwoldLily OuAesha ParekhSamhita HonnavalliSharon LevyDiba MirzaWilliam Yang Wang
2020-10-06
An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks
| Kyubyong ParkJoohong LeeSeongbo JangDawoon Jung
2020-10-06
Converting the Point of View of Messages Spoken to Virtual Assistants
| Isabelle G. LeeVera ZuSai Srujana BuddiDennis LiangJack G. M. FitzGerald
2020-10-06
On the Sub-Layer Functionalities of Transformer Decoder
Yilin YangLongyue WangShuming ShiPrasad TadepalliStefan LeeZhaopeng Tu
2020-10-06
Incorporating Behavioral Hypotheses for Query Generation
Ruey-Cheng ChenChia-Jung Lee
2020-10-06
Analyzing Individual Neurons in Pre-trained Language Models
Nadir DurraniHassan SajjadFahim DalviYonatan Belinkov
2020-10-06
Resource-Enhanced Neural Model for Event Argument Extraction
Jie MaShuai WangRishita AnubhaiMiguel BallesterosYaser Al-Onaizan
2020-10-06
Beyond [CLS] through Ranking by Generation
Cicero Nogueira dos santosXiaofei MaRamesh NallapatiZhiheng HuangBing Xiang
2020-10-06
Efficient Inference For Neural Machine Translation
Yi-Te HsuSarthak GargYi-Hsiu LiaoIlya Chatsviorkin
2020-10-06
PAIR: Planning and Iterative Refinement in Pre-trained Transformers for Long Text Generation
Xinyu HuaLu Wang
2020-10-05
Transformer-Based Neural Text Generation with Syntactic Guidance
Yinghao LiRui FengIsaac RehgChao Zhang
2020-10-05
How Effective is Task-Agnostic Data Augmentation for Pretrained Transformers?
Shayne LongpreYu WangChristopher DuBois
2020-10-05
Pruning Redundant Mappings in Transformer Models via Spectral-Normalized Identity Prior
Zi LinJeremiah Zhe LiuZi YangNan HuaDan Roth
2020-10-05
GenAug: Data Augmentation for Finetuning Text Generators
Steven Y. FengVarun GangalDongyeop KangTeruko MitamuraEduard Hovy
2020-10-05
PUM at SemEval-2020 Task 12: Aggregation of Transformer-based models' features for offensive language recognition
Piotr JaniszewskiMateusz SkibaUrszula Walińska
2020-10-05
Tell Me How to Ask Again: Question Data Augmentation with Controllable Rewriting in Continuous Space
Dayiheng LiuYeyun GongJie FuYu YanJiusheng ChenJiancheng LvNan DuanMing Zhou
2020-10-04
Inquisitive Question Generation for High Level Text Comprehension
Wei-Jen KoTe-Yuan ChenYiyan HuangGreg DurrettJunyi Jessy Li
2020-10-04
STIL -- Simultaneous Slot Filling, Translation, Intent Classification, and Language Identification: Initial Results using mBART on MultiATIS++
Jack G. M. FitzGerald
2020-10-02
Data Transfer Approaches to Improve Seq-to-Seq Retrosynthesis
katsuhiko IshiguroKazuya UjiharaRyohto SawadaHirotaka AkitaMasaaki Kotera
2020-10-02
Beyond Chemical 1D knowledge using Transformers
Ruud Van DeursenIgor V. TetkoGuillaume Godin
2020-10-02
Evaluating Multilingual BERT for Estonian
Claudia KittaskKirill MilintsevichKairit Sirts
2020-10-01
Phonemer at WNUT-2020 Task 2: Sequence Classification Using COVID Twitter BERT and Bagging Ensemble Technique based on Plurality Voting
| Anshul Wadhawan
2020-10-01
A Compare Aggregate Transformer for Understanding Document-grounded Dialogue
Longxuan MaWei-Nan ZhangRunxin SunTing Liu
2020-10-01
Examining the rhetorical capacities of neural language models
Zining ZhuChuer PanMohamed AbdallaFrank Rudzicz
2020-10-01
CoLAKE: Contextualized Language and Knowledge Embedding
| Tianxiang SunYunfan ShaoXipeng QiuQipeng GuoYaru HuXuanjing HuangZheng Zhang
2020-10-01
WeChat Neural Machine Translation Systems for WMT20
Fandong MengJianhao YanYijin LiuYuan GaoXianfeng ZengQinsong ZengPeng LiMing ChenJie zhouSifan LiuHao Zhou
2020-10-01
Learning Hard Retrieval Cross Attention for Transformer
Hongfei XuQiuhui Liu
2020-09-30
Measuring Systematic Generalization in Neural Proof Generation with Transformers
Nicolas GontierKoustuv SinhaSiva ReddyChristopher Pal
2020-09-30
Rethinking Attention with Performers
| Krzysztof ChoromanskiValerii LikhosherstovDavid DohanXingyou SongAndreea GaneTamas SarlosPeter HawkinsJared DavisAfroz MohiuddinLukasz KaiserDavid BelangerLucy ColwellAdrian Weller
2020-09-30
MQTransformer: Multi-Horizon Forecasts with Context Dependent and Feedback-Aware Attention
Carson EisenachYagna PatelDhruv Madeka
2020-09-30
Gender prediction using limited Twitter Data
Maaike BurghoornMaaike H. T. de BoerStephan Raaijmakers
2020-09-29
Visually-Grounded Planning without Vision: Language Models Infer Detailed Plans from High-level Instructions
| Peter A. Jansen
2020-09-29
Attention that does not Explain Away
Nan DingXinjie FanZhenzhong LanDale SchuurmansRadu Soricut
2020-09-29
The design and implementation of Language Learning Chatbot with XAI using Ontology and Transfer Learning
Nuobei ShiQin ZengRaymond Lee
2020-09-29
A Simple but Tough-to-Beat Data Augmentation Approach for Natural Language Understanding and Generation
Dinghan ShenMingzhi ZhengYelong ShenYanru QuWeizhu Chen
2020-09-29
Sequence-to-Sequence Learning for Indonesian Automatic Question Generator
Ferdiant Joshua MuisAyu Purwarianti
2020-09-29
VIVO: Surpassing Human Performance in Novel Object Captioning with Visual Vocabulary Pre-Training
Xiaowei HuXi YinKevin LinLijuan WangLei ZhangJianfeng GaoZicheng Liu
2020-09-28
Accelerating Multi-Model Inference by Merging DNNs of Different Weights
Joo Seong JeongSoojeong KimGyeong-In YuYunseong LeeByung-Gon Chun
2020-09-28
Deep Transformers with Latent Depth
Xian LiAsa Cooper SticklandYuqing TangXiang Kong
2020-09-28
What does it mean to be language-agnostic? Probing multilingual sentence encoders for typological properties
Rochelle ChoenniEkaterina Shutova
2020-09-27
KG-BART: Knowledge Graph-Augmented BART for Generative Commonsense Reasoning
| Ye LiuYao WanLifang HeHao PengPhilip S. Yu
2020-09-26
BET: A Backtranslation Approach for Easy Data Augmentation in Transformer-based Paraphrase Identification Context
Jean-Philippe CorbeilHadi Abdi Ghadivel
2020-09-25
MinTL: Minimalist Transfer Learning for Task-Oriented Dialogue Systems
Zhaojiang LinAndrea MadottoGenta Indra WinataPascale Fung
2020-09-25
Weird AI Yankovic: Generating Parody Lyrics
Mark Riedl
2020-09-25
A little goes a long way: Improving toxic language classification despite data scarcity
Mika JuutiTommi GröndahlAdrian FlanaganN. Asokan
2020-09-25
Toward a Thermodynamics of Meaning
Jonathan Scott Enderle
2020-09-24
Multi-Pass Transformer for Machine Translation
Peng GaoChiori HoriShijie GengTakaaki HoriJonathan Le Roux
2020-09-23
Hamming OCR: A Locality Sensitive Hashing Neural Network for Scene Text Recognition
Bingcong LiXin TangXianbiao QiYihao ChenRong Xiao
2020-09-23
Robustification of Segmentation Models Against Adversarial Perturbations In Medical Imaging
Hanwool ParkAmirhossein BayatMohammad SabokrouJan S. KirschkeBjoern H. Menze
2020-09-23
On Data Augmentation for Extreme Multi-label Classification
Danqing ZhangTao LiHaiyang ZhangBing Yin
2020-09-22
UCD-CS at W-NUT 2020 Shared Task-3: A Text to Text Approach for COVID-19 Event Extraction on Social Media
Congcong WangDavid Lillis
2020-09-21
Alleviating the Inequality of Attention Heads for Neural Machine Translation
Zewei SunShujian HuangXinyu DaiJiajun Chen
2020-09-21
Empathetic Dialogue Generation via Knowledge Enhancing and Emotion Dependency Modeling
Qintong LiPiji LiZhumin ChenZhaochun Ren
2020-09-21
Prior Art Search and Reranking for Generated Patent Text
Jieh-Sheng LeeJieh Hsiang
2020-09-19
Towards Computational Linguistics in Minangkabau Language: Studies on Sentiment Analysis and Machine Translation
Fajri KotoIkhwan Koto
2020-09-19
Hierarchical GPT with Congruent Transformers for Multi-Sentence Language Models
Jihyeon RohHuiseong GimSoo-Young Lee
2020-09-18
Towards Fully 8-bit Integer Inference for the Transformer Model
Ye LinYanyang LiTengbo LiuTong XiaoTongran LiuJingbo Zhu
2020-09-17
Multi^2OIE: Multilingual Open Information Extraction based on Multi-Head Attention with BERT
Youngbin RoYukyung LeePilsung Kang
2020-09-17
GraphCodeBERT: Pre-training Code Representations with Data Flow
Daya GuoShuo RenShuai LuZhangyin FengDuyu TangShujie LiuLong ZhouNan DuanJian YinDaxin JiangMing Zhou
2020-09-17
Document-level Neural Machine Translation with Document Embeddings
Shu JiangHai ZhaoZuchao LiBao-Liang Lu
2020-09-16
Retrofitting Structure-aware Transformer Language Model for End Tasks
Hao FeiYafeng RenDonghong Ji
2020-09-16
Extremely Low Bit Transformer Quantization for On-Device Neural Machine Translation
Insoo ChungByeongwook KimYoonjung ChoiSe Jung KwonYongkweon JeonBaeseong ParkSangha KimDongsoo Lee
2020-09-16
Graph-to-Sequence Neural Machine Translation
Sufeng DuanHai ZhaoRui Wang
2020-09-16
NABU -- Multilingual Graph-based Neural RDF Verbalizer
Diego MoussallemDwaraknath GnaneshwarThiago Castro FerreiraAxel-Cyrille Ngonga Ngomo
2020-09-16
Automated Source Code Generation and Auto-completion Using Deep Learning: Comparing and Discussing Current Language-Model-Related Approaches
Juan Cruz-BenitoSanjay VishwakarmaFrancisco Martin-FernandezIsmael Faro
2020-09-16
The Radicalization Risks of GPT-3 and Advanced Neural Language Models
Kris McGuffieAlex Newhouse
2020-09-15
Dialogue Response Ranking Training with Large-Scale Human Feedback Data
| Xiang GaoYizhe ZhangMichel GalleyChris BrockettBill Dolan
2020-09-15
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners
| Timo SchickHinrich Schütze
2020-09-15
Critical Thinking for Language Models
Gregor Betz
2020-09-15
Attention-Aware Inference for Neural Abstractive Summarization
Ye MaLu Zong
2020-09-15
Event Presence Prediction Helps Trigger Detection Across Languages
Parul AwasthyTahira NaseemJian NiTaesun MoonRadu Florian
2020-09-15
Efficient Transformers: A Survey
Yi TayMostafa DehghaniDara BahriDonald Metzler
2020-09-14
GeDi: Generative Discriminator Guided Sequence Generation
Ben KrauseAkhilesh Deepak GotmareBryan McCannNitish Shirish KeskarShafiq JotyRichard SocherNazneen Fatema Rajani
2020-09-14
BoostingBERT:Integrating Multi-Class Boosting into BERT for NLP Tasks
Tongwen HuangQingyun SheJunlin Zhang
2020-09-13
Fine-tuning Pre-trained Contextual Embeddings for Citation Content Analysis in Scholarly Publication
Haihua ChenHuyen Nguyen
2020-09-12
Unit Test Case Generation with Transformers
Michele TufanoDawn DrainAlexey SvyatkovskiyShao Kun DengNeel Sundaresan
2020-09-11
Compressed Deep Networks: Goodbye SVD, Hello Robust Low-Rank Approximation
Murad TukanAlaa MaaloufMatan WekslerDan Feldman
2020-09-11
UPB at SemEval-2020 Task 6: Pretrained Language Models for DefinitionExtraction
Andrei-Marius AvramDumitru-Clementin CercelCostin-Gabriel Chiru
2020-09-11
GTEA: Representation Learning for Temporal Interaction Graphs via Edge Aggregation
Yiming LiDa Sun Handason TamSiyue XieXiaxin LiuQiu Fang YingWing Cheong LauDah Ming ChiuShou Zhi Chen
2020-09-11
FILTER: An Enhanced Fusion Method for Cross-lingual Language Understanding
Yuwei FangShuohang WangZhe GanSiqi SunJingjing Liu
2020-09-10
Rank over Class: The Untapped Potential of Ranking in Natural Language Processing
Amir Atapour-AbarghoueiStephen BonnerAndrew Stephen McGough
2020-09-10
Sparsifying Transformer Models with Differentiable Representation Pooling
Michał PietruszkaŁukasz BorchmannFilip Graliński
2020-09-10
Brain2Word: Decoding Brain Activity for Language Generation
Nicolas AffolterBeni EgressyDamian PascualRoger Wattenhofer
2020-09-10
Modern Methods for Text Generation
| Dimas Munoz Montesinos
2020-09-10
Learning Universal Representations from Word to Sentence
Yian LiHai Zhao
2020-09-10
Pay Attention when Required
Swetha MandavaSzymon MigaczAlex Fit Florea
2020-09-09
Central Yup'ik and Machine Translation of Low-Resource Polysynthetic Languages
Christopher LiuLaura DominéKevin ChavezRichard Socher
2020-09-09
Masked Label Prediction: Unified Massage Passing Model for Semi-Supervised Classification
Yunsheng ShiZhengjie HuangWenjin WangHui ZhongShikun FengYu Sun
2020-09-08
Improving Language Generation with Sentence Coherence Objective
Ruixiao SunJie YangMehrdad Yousefzadeh
2020-09-07
Robust Conversational AI with Grounded Text Generation
Jianfeng GaoBaolin PengChunyuan LiJinchao LiShahin ShayandehLars LidenHeung-Yeung Shum
2020-09-07
Black Box to White Box: Discover Model Characteristics Based on Strategic Probing
Josh KalinMatthew CiolinoDavid NoeverGerry Dozier
2020-09-07
Measuring Massive Multitask Language Understanding
Dan HendrycksCollin BurnsSteven BasartAndy ZouMantas MazeikaDawn SongJacob Steinhardt
2020-09-07
TransModality: An End2End Fusion Method with Transformer for Multimodal Sentiment Analysis
Zilong WangZhaohong WanXiaojun Wan
2020-09-07
Adversarial Watermarking Transformer: Towards Tracing Text Provenance with Data Hiding
Sahar AbdelnabiMario Fritz
2020-09-07
EdinburghNLP at WNUT-2020 Task 2: Leveraging Transformers with Generalized Augmentation for Identifying Informativeness in COVID-19 Tweets
Nickil Maveli
2020-09-06
QiaoNing at SemEval-2020 Task 4: Commonsense Validation and Explanation system based on ensemble of language model
Pai Liu
2020-09-06
Comparative Evaluation of Pretrained Transfer Learning Models on Automatic Short Answer Grading
Sasi Kiran GaddipatiDeebul NairPaul G. Plöger
2020-09-02
LiftFormer: 3D Human Pose Estimation using attention models
Adrian Llopart
2020-09-01
Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition
Wei LiJames QinChung-Cheng ChiuRuoming PangYanzhang He
2020-08-30
HittER: Hierarchical Transformers for Knowledge Graph Embeddings
Sanxing ChenXiaodong LiuJianfeng GaoJian JiaoRuofei ZhangYangfeng Ji
2020-08-28
TATL at W-NUT 2020 Task 2: A Transformer-based Baseline System for Identification of Informative COVID-19 English Tweets
Anh Tuan Nguyen
2020-08-28
Knowledge Efficient Deep Learning for Natural Language Processing
Hai Wang
2020-08-28
DAVE: Deriving Automatically Verilog from English
Hammond PearceBenjamin TanRamesh Karri
2020-08-27
Improvement of a dedicated model for open domain persona-aware dialogue generation
Qiang Han
2020-08-27
Discrete Word Embedding for Logical Natural Language Understanding
Masataro AsaiZilu Tang
2020-08-26
A Multitask Deep Learning Approach for User Depression Detection on Sina Weibo
Yiding WangZhenyi WangChenghao LiYilin ZhangHaizhou Wang
2020-08-26
End to End Dialogue Transformer
Ondřej MěkotaMemduh GökırmakPetr Laitoch
2020-08-24
Identity-Aware Multi-Sentence Video Description
| Jae Sung ParkTrevor DarrellAnna Rohrbach
2020-08-22
PTT5: Pretraining and validating the T5 model on Brazilian Portuguese data
| Diedre CarmoMarcos PiauIsrael CampiottiRodrigo NogueiraRoberto Lotufo
2020-08-20
Lite Training Strategies for Portuguese-English and English-Portuguese Translation
Alexandre LopesRodrigo NogueiraRoberto LotufoHelio Pedrini
2020-08-20
Are Neural Open-Domain Dialog Systems Robust to Speech Recognition Errors in the Dialog History? An Empirical Study
Karthik GopalakrishnanBehnam HedayatniaLongshaokan WangYang LiuDilek Hakkani-Tur
2020-08-18
Estimation of causal effects of multiple treatments in healthcare database studies with rare outcomes
Liangyuan HuChenyang Gu
2020-08-18
Very Deep Transformers for Neural Machine Translation
Xiaodong LiuKevin DuhLiyuan LiuJianfeng Gao
2020-08-18
Glancing Transformer for Non-Autoregressive Neural Machine Translation
Lihua QianHao ZhouYu BaoMingxuan WangLin QiuWeinan ZhangYong YuLei Li
2020-08-18
Generative Models are Unsupervised Predictors of Page Quality: A Colossal-Scale Study
Dara BahriYi TayChe ZhengDonald MetzlerCliff BrunkAndrew Tomkins
2020-08-17
Spatial Temporal Transformer Network for Skeleton-based Action Recognition
| Chiara PlizzariMarco CanniciMatteo Matteucci
2020-08-17
Narrative Interpolation for Generating and Understanding Stories
Su WangGreg DurrettKatrin Erk
2020-08-17
TopicBERT: A Transformer transfer learning based memory-graph approach for multimodal streaming social media topic detection
Meysam Asgari-ChenaghluMohammad-Reza Feizi-DerakhshiLeili farzinvashMohammad-Ali BalafarCina Motamed
2020-08-16
DCR-Net: A Deep Co-Interactive Relation Network for Joint Dialog Act Recognition and Sentiment Classification
Libo QinWanxiang CheYangming LiMinheng NiTing Liu
2020-08-16
Adding Recurrence to Pretrained Transformers for Improved Efficiency and Context Size
Davis YoshidaAllyson EttingerKevin Gimpel
2020-08-16
Finding Fast Transformers: One-Shot Neural Architecture Search by Component Composition
Henry TsaiJayden OoiChun-Sung FerngHyung Won ChungJason Riesa
2020-08-15
A Hybrid BERT and LightGBM based Model for Predicting Emotion GIF Categories on Twitter
Ye BiShuo WangZhongrui Fan
2020-08-14
Adaptable Multi-Domain Language Model for Transformer ASR
Taewoo LeeMin-Joong LeeTae Gyoon KangSeokyeoung JungMinseok KwonYeona HongJungin LeeKyoung-Gu WooHo-Gyeong KimJiseung JeongJihyun LeeHosik LeeYoung Sang Choi
2020-08-14
Language Models as Few-Shot Learner for Task-Oriented Dialogue Systems
Andrea Madotto
2020-08-14
MMM : Exploring Conditional Multi-Track Music Generation with the Transformer
Jeff EnsPhilippe Pasquier
2020-08-13
Large-scale Transfer Learning for Low-resource Spoken Language Understanding
Xueli JiaJianzong WangZhiyong ZhangNing ChengJing Xiao
2020-08-13
Conv-Transformer Transducer: Low Latency, Low Frame Rate, Streamable End-to-End Speech Recognition
Wenyong HuangWenchao HuYu Ting YeungXiao Chen
2020-08-13
End-to-end Contextual Perception and Prediction with Interaction Transformer
Lingyun Luke LiBin YangMing LiangWenyuan ZengMengye RenSean SegalRaquel Urtasun
2020-08-13
Evaluating the Impact of Knowledge Graph Context on Entity Disambiguation Models
| Isaiah Onando Mulang'Kuldeep SinghChaitali PrabhuAbhishek NadgeriJohannes HoffartJens Lehmann
2020-08-12
Compression of Deep Learning Models for Text: A Survey
Manish GuptaPuneet Agrawal
2020-08-12
Fine-grained Visual Textual Alignment for Cross-Modal Retrieval using Transformer Encoders
| Nicola MessinaGiuseppe AmatoAndrea EsuliFabrizio FalchiClaudio GennaroStéphane Marchand-Maillet
2020-08-12
KR-BERT: A Small-Scale Korean-Specific Language Model
Sangah LeeHansol JangYunmee BaikSuzi ParkHyopil Shin
2020-08-10
Navigating Language Models with Synthetic Agents
Philip Feldman
2020-08-10
Pretraining Techniques for Sequence-to-Sequence Voice Conversion
Wen-Chin HuangTomoki HayashiYi-Chiao WuHirokazu KameokaTomoki Toda
2020-08-07
Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets
| Patrick LewisPontus StenetorpSebastian Riedel
2020-08-06
6VecLM: Language Modeling in Vector Space for IPv6 Target Generation
Tianyu CuiGang XiongGaopeng GouJunzheng ShiWei Xia
2020-08-05
The Jazz Transformer on the Front Line: Exploring the Shortcomings of AI-composed Music through Quantitative Measures
| Shih-Lun WuYi-Hsuan Yang
2020-08-04
[email protected] at SemEval-2020 Task 12: Multilingual or language-specific BERT?
Marc PàmiesEmily ÖhmanKaisla KajavaJörg Tiedemann
2020-08-03
Self-attention encoding and pooling for speaker recognition
Pooyan SafariMiquel IndiaJavier Hernando
2020-08-03
The Chess Transformer: Mastering Play using Generative Language Models
David NoeverMatt CiolinoJosh Kalin
2020-08-02
SeqDialN: Sequential Visual Dialog Networks in Joint Visual-Linguistic Representation Space
Liu YangFanqi MengMing-Kuang Daniel WuVicent YingXianchao Xu
2020-08-02
Trojaning Language Models for Fun and Profit
Xinyang ZhangZheng ZhangTing Wang
2020-08-01
Multi-node Bert-pretraining: Cost-efficient Approach
Jiahuang LinXin LiGennady Pekhimenko
2020-08-01
On Learning Universal Representations Across Languages
Xiangpeng WeiYue HuRongxiang WengLuxi XingHeng YuWeihua Luo
2020-07-31
TweepFake: about Detecting Deepfake Tweets
Tiziano FagniFabrizio FalchiMargherita GambiniAntonio MartellaMaurizio Tesconi
2020-07-31
Deep Multi-View Spatiotemporal Virtual Graph Neural Network for Significant Citywide Ride-hailing Demand Prediction
Guangyin JinZhexu XiHengyu ShaYanghe FengJincai Huang
2020-07-30
Interpretable Contextual Team-aware Item Recommendation: Application in Multiplayer Online Battle Arena Games
| Andrés VillaVladimir AraujoFrancisca CattanDenis Parra
2020-07-30
Composer Style Classification of Piano Sheet Music Images Using Language Model Pretraining
TJ TsaiKevin Ji
2020-07-29
TensorCoder: Dimension-Wise Attention via Tensor Representation for Natural Language Modeling
Shuai ZhangPeng ZhangXindian MaJunqiu Weiningning WangQun Liu
2020-07-28
To BERT or Not To BERT: Comparing Speech and Language-based Approaches for Alzheimer's Disease Detection
Aparna BalagopalanBenjamin EyreFrank RudziczJekaterina Novikova
2020-07-26
FiSSA at SemEval-2020 Task 9: Fine-tuned For Feelings
| Bertelt BraaksmaRichard ScholtensStan van SuijlekomRemy WangAhmet Üstün
2020-07-24
Exploring Swedish & English fastText Embeddings with the Transformer
| Tosin P. AdewumiFoteini LiwickiMarcus Liwicki
2020-07-23
CrossTransformers: spatially-aware few-shot transfer
Carl DoerschAnkush GuptaAndrew Zisserman
2020-07-22
Analogical Reasoning for Visually Grounded Language Acquisition
Bo WuHaoyu QinAlireza ZareianCarl VondrickShih-Fu Chang
2020-07-22
SliceOut: Training Transformers and CNNs faster while using less memory
Pascal NotinAidan N. GomezJoanna YooYarin Gal
2020-07-21
Neural Machine Translation with Error Correction
Kaitao SongXu TanJianfeng Lu
2020-07-21
Learning Joint Spatial-Temporal Transformations for Video Inpainting
| Yanhong ZengJianlong FuHongyang Chao
2020-07-20
Conformer-Kernel with Query Term Independence for Document Retrieval
| Bhaskar MitraSebastian HofstatterHamed ZamaniNick Craswell
2020-07-20
Temporal Pointwise Convolutional Networks for Length of Stay Prediction in the Intensive Care Unit
| Emma RocheteauPietro LiòStephanie Hyland
2020-07-18
Feature Pyramid Transformer
| Dong ZhangHanwang ZhangJinhui TangMeng WangXiansheng HuaQianru Sun
2020-07-18
Generative Pretraining from Pixels
| Mark ChenAlec RadfordRewon ChildJeff WuHeewoo JunPrafulla DhariwalDavid LuanIlya Sutskever
2020-07-17
Deep Learning Based Traffic Surveillance System For Missing and Suspicious Car Detection
K. V. KadambariVishnu Vardhan Nimmalapudi
2020-07-17
CTC-Segmentation of Large Corpora for German End-to-end Speech Recognition
Ludwig KürzingerDominik WinkelbauerLujun LiTobias WatzelGerhard Rigoll
2020-07-17
Investigating Pretrained Language Models for Graph-to-Text Generation
Leonardo F. R. RibeiroMartin SchmittHinrich SchützeIryna Gurevych
2020-07-16
Hopfield Networks is All You Need
| Hubert RamsauerBernhard SchäflJohannes LehnerPhilipp SeidlMichael WidrichLukas GruberMarkus HolzleitnerMilena PavlovićGeir Kjetil SandveVictor GreiffDavid KreilMichael KoppGünter KlambauerJohannes BrandstetterSepp Hochreiter
2020-07-16
InfoXLM: An Information-Theoretic Framework for Cross-Lingual Language Model Pre-Training
| Zewen ChiLi DongFuru WeiNan YangSaksham SinghalWenhui WangXia SongXian-Ling MaoHeyan HuangMing Zhou
2020-07-15
The Monte Carlo Transformer: a stochastic self-attention model for sequence prediction
Alice MartinCharles OllionFlorian StrubSylvain Le CorffOlivier Pietquin
2020-07-15
Deep Transformer based Data Augmentation with Subword Units for Morphologically Rich Online ASR
Balázs TarjánGyörgy SzaszákTibor FegyóPéter Mihajlik
2020-07-14
Contextualized Code Representation Learning for Commit Message Generation
Lun Yiu NieCuiyun GaoZhicong ZhongWai LamYang LiuZenglin Xu
2020-07-14
Emoji Prediction: Extensions and Benchmarking
Weicheng MaRuibo LiuLili WangSoroush Vosoughi
2020-07-14
Paranoid Transformer: Reading Narrative of Madness as Computational Approach to Creativity
Yana AgafonovaAlexey TikhonovIvan P. Yamshchikov
2020-07-13
Transformer with Depth-Wise LSTM
Hongfei XuQiuhui LiuDeyi XiongJosef van Genabith
2020-07-13
Sparse Graph to Sequence Learning for Vision Conditioned Long Textual Sequence Generation
Aditya MogadalaMarius MosbachDietrich Klakow
2020-07-12
TERA: Self-Supervised Learning of Transformer Encoder Representation for Speech
| Andy T. LiuShang-Wen LiHung-yi Lee
2020-07-12
HyperGrid: Efficient Multi-Task Transformers with Grid-wise Decomposable Hyper Projections
Yi TayZhe ZhaoDara BahriDonald MetzlerDa-Cheng Juan
2020-07-12
Sequence Generation with Mixed Representations
Lijun Wu Shufang Xie Yingce Xia Fan Yang Tao Qin Jianhuang Lai Tie-Yan Liu
2020-07-11
BISON:BM25-weighted Self-Attention Framework for Multi-Fields Document Search
| Xuan ShanChuanjie LiuYiqian XiaQi ChenYusi ZhangAngen LuoYuxiang Luo
2020-07-10
DeepSinger: Singing Voice Synthesis with Data Mined From the Web
Yi RenXu TanTao QinJian LuanZhou ZhaoTie-Yan Liu
2020-07-09
Advances of Transformer-Based Models for News Headline Generation
| Alexey BukhtiyarovIlya Gusev
2020-07-09
The Go Transformer: Natural Language Modeling for Game Play
Matthew CiolinoDavid NoeverJosh Kalin
2020-07-07
Do Transformers Need Deep Long-Range Memory
Jack W. RaeAli Razavi
2020-07-07
Relevance Transformer: Generating Concise Code Snippets with Relevance Feedback
Carlos GemmellFederico RossettoJeffrey Dalton
2020-07-06
Learning to Segment Anatomical Structures Accurately from One Exemplar
Yuhang LuWeijian LiKang ZhengYirui WangAdam P. HarrisonChihung LinSong WangJing XiaoLe LuChang-Fu KuoShun Miao
2020-07-06
You Autocomplete Me: Poisoning Vulnerabilities in Neural Code Completion
Roei SchusterCongzheng SongEran TromerVitaly Shmatikov
2020-07-05
Abstractive and mixed summarization for long-single documents
Roger BarrullJugal Kalita
2020-07-03
On-The-Fly Information Retrieval Augmentation for Language Models
Hai WangDavid McAllester
2020-07-03
Self-Attention Guided Copy Mechanism for Abstractive Summarization
Song XuHaoran LiPeng YuanYouzheng WuXiaodong HeBowen Zhou
2020-07-01
Multimodal and Multiresolution Speech Recognition with Transformers
Georgios ParaskevopoulosSrinivas ParthasarathyAparna KhareShiva Sundaram
2020-07-01
Roles and Utilization of Attention Heads in Transformer-based Neural Language Models
Jae-young JoSung-Hyon Myaeng
2020-07-01
Multimodal Transformer for Multimodal Machine Translation
Shaowei YaoXiaojun Wan
2020-07-01
Paraphrase Generation by Learning How to Edit from Samples
Amirhossein KazemnejadMohammadreza SalehiMahdieh Soleymani Baghshah
2020-07-01
Dependency Graph Enhanced Dual-transformer Structure for Aspect-based Sentiment Classification
Hao TangDonghong JiChenliang LiQiji Zhou
2020-07-01
In Neural Machine Translation, What Does Transfer Learning Transfer?
Alham Fikri AjiNikolay BogoychevKenneth HeafieldRico Sennrich
2020-07-01
Feature Projection for Improved Text Classification
Qi QinWenpeng HuBing Liu
2020-07-01
Addressing Posterior Collapse with Mutual Information for Improved Variational Neural Machine Translation
Arya D. McCarthyXian LiJiatao GuNing Dong
2020-07-01
DIALOGPT : Large-Scale Generative Pre-training for Conversational Response Generation
| Yizhe ZhangSiqi SunMichel GalleyYen-Chun ChenChris BrockettXiang GaoJianfeng GaoJingjing LiuBill Dolan
2020-07-01
Combining Subword Representations into Word-level Representations in the Transformer Architecture
Noe CasasMarta R. Costa-juss{\`a}Jos{\'e} A. R. Fonollosa
2020-07-01
Robust Neural Machine Translation with ASR Errors
Haiyang XueYang FengShuhao GuWei Chen
2020-07-01
An empirical investigation of neural methods for content scoring of science explanations
Brian RiordanSarah BichlerAllison BradfordJennifer King ChenKorah WileyLibby GerardMarcia C. Linn
2020-07-01
Neural Transduction of Letter Position Dyslexia using an Anagram Matrix Representation
Avi Bleiweiss
2020-07-01
Character aware models with similarity learning for metaphor detection
Tarun KumarYashvardhan Sharma
2020-07-01
A Transformer Approach to Contextual Sarcasm Detection in Twitter
Hunter GregorySteven LiPouya MohammadiNatalie TarnRachel DraelosCynthia Rudin
2020-07-01
Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal Dependency Parsing
Han HeJinho D. Choi
2020-07-01
KIT's IWSLT 2020 SLT Translation System
Ngoc-Quan PhamFelix SchneiderTuan-Nam NguyenThanh-Le HaThai Son NguyenMaximilian AwiszusSebastian St{\"u}kerAlex Waibeler
2020-07-01
End-to-End Simultaneous Translation System for IWSLT2020 Using Modality Agnostic Meta-Learning
Hou Jeung HanMohd Abbas ZaidiSathish Reddy IndurthiNikhil Kumar LakumarapuBeomseok LeeSangha Kim
2020-07-01
End-to-End Offline Speech Translation System for IWSLT 2020 using Modality Agnostic Meta-Learning
Nikhil Kumar LakumarapuBeomseok LeeSathish Reddy IndurthiHou Jeung HanMohd Abbas ZaidiSangha Kim
2020-07-01
SRPOL's System for the IWSLT 2020 End-to-End Speech Translation Task
Tomasz PotapczykPawel Przybysz
2020-07-01
The AFRL IWSLT 2020 Systems: Work-From-Home Edition
Brian OreEric HansenTim AndersonJeremy Gwinnup
2020-07-01
OPPO's Machine Translation System for the IWSLT 2020 Open Domain Translation Task
Qian ZhangXiaopu LiDawei DangTingxun ShiDi AiZhengshan XueJie Hao
2020-07-01
CASIA's System for IWSLT 2020 Open Domain Translation
Qian WangYuchen LiuCong MaYu LuYining WangLong ZhouYang ZhaoJiajun ZhangChengqing Zong
2020-07-01
Deep Blue Sonics' Submission to IWSLT 2020 Open Domain Translation Task
Enmin SuYi Ren
2020-07-01
University of Tsukuba's Machine Translation System for IWSLT20 Open Domain Translation Task
Hongyi CuiYizhen WeiShohei IidaTakehito UtsuroMasaaki Nagata
2020-07-01
Xiaomi's Submissions for IWSLT 2020 Open Domain Translation Task
Yuhui SunMengxue GuoXiang LiJianwei CuiBin Wang
2020-07-01
The HW-TSC Video Speech Translation System at IWSLT 2020
Minghan WangHao YangYao DengYing QinLizhi LeiDaimeng WeiHengchao ShangNing XieXiaochun LiJiaxian Guo
2020-07-01
Towards Stream Translation: Adaptive Computation Time for Simultaneous Machine Translation
Felix SchneiderAlex Waibeler
2020-07-01
Compressing Neural Machine Translation Models with 4-bit Precision
Alham Fikri AjiKenneth Heafield
2020-07-01
Training and Inference Methods for High-Coverage Neural Machine Translation
Michael YangYixin LiuRahul Mayuranath
2020-07-01
Expand and Filter: CUNI and LMU Systems for the WNGT 2020 Duolingo Shared Task
Jind{\v{r}}ich Libovick{\'y}Zden{\v{e}}k KasnerJind{\v{r}}ich HelclOnd{\v{r}}ej Du{\v{s}}ek
2020-07-01
The NiuTrans System for WNGT 2020 Efficiency Task
Chi HuBei LiYinqiao LiYe LinYanyang LiChenglong WangTong XiaoJingbo Zhu
2020-07-01
Efficient and High-Quality Neural Machine Translation with OpenNMT
Guillaume KleinDakun ZhangCl{\'e}ment ChouteauJosep CregoJean Senellart
2020-07-01
Improving Document-Level Neural Machine Translation with Domain Adaptation
Sami Ul HaqSadaf Abdul RaufArslan ShoukatNoor-e- Hira
2020-07-01
CopyBERT: A Unified Approach to Question Generation with Self-Attention
Stalin VaranasiSaadullah AminGuenter Neumann
2020-07-01
How to Tame Your Data: Data Augmentation for Dialog State Tracking
Adam SummervilleJordan HashemiJames Ryanwilliam ferguson
2020-07-01
Methods for Extracting Information from Messages from Primary Care Providers to Specialists
Xiyu DingMichael BarnettAteev MehrotraTimothy Miller
2020-07-01
Generating Medical Reports from Patient-Doctor Conversations Using Sequence-to-Sequence Models
Seppo EnarviMarilisa AmoiaMiguel Del-Agua TebaBrian DelaneyFrank DiehlStefan HahnKristina HarrisLiam McGrathYue PanJoel PintoLuca RubiniMiguel RuizGag SingheepFabian StemmerWeiyi SunPaul VozilaThomas LinRanjani Ramamurthy
2020-07-01
Enhancing Transformer with Sememe Knowledge
Yuhui ZhangChenghao YangZhengping ZhouZhiyuan Liu
2020-07-01
Grapheme-to-Phoneme Conversion with a Multilingual Transformer Model
Omnia ElSaadanyBenjamin Suter
2020-07-01
Frustratingly Easy Multilingual Grapheme-to-Phoneme Conversion
Nikhil PrabhuKatharina Kann
2020-07-01
Leveraging Principal Parts for Morphological Inflection
Ling LiuMans Hulden
2020-07-01
Data Augmentation for Transformer-based G2P
Zach RyanMans Hulden
2020-07-01
HausaMT v1.0: Towards English--Hausa Neural Machine Translation
Adewale Akinfaderin
2020-07-01
An Evaluation of Subword Segmentation Strategies for Neural Machine Translation of Morphologically Rich Languages
Aquia RichburgEskRamy erSmar MuresanaMarine Carpuat
2020-07-01
On-The-Fly Information Retrieval Augmentation for Language Models
Hai WangDavid McAllester
2020-07-01
Integrating Multimodal Information in Large Pretrained Transformers
Wasifur RahmanMd Kamrul HasanSangwu LeeAmirAli Bagher ZadehChengfeng MaoLouis-Philippe MorencyEhsan Hoque
2020-07-01
Towards Holistic and Automatic Evaluation of Open-Domain Dialogue Generation
Bo PangErik NijkampWenjuan HanLinqi ZhouYixian LiuKewei Tu
2020-07-01
Unsupervised Multilingual Sentence Embeddings for Parallel Corpus Mining
Ivana Kvapil{\'\i}kov{\'a}Mikel ArtetxeGorka LabakaEneko AgirreOnd{\v{r}}ej Bojar
2020-07-01
Detecting Sarcasm in Conversation Context Using Transformer-Based Models
Adithya AvvaruSanath VobilisettyRadhika Mamidi
2020-07-01
Metaphor Detection Using Contextual Word Embeddings From Transformers
Jerry LiuNathan O{'}HaraAlex RubinerRachel DraelosCynthia Rudin
2020-07-01
POSTECH Submission on Duolingo Shared Task
Junsu ParkHongseok KwonJong-Hyeok Lee
2020-07-01
LSTM and GPT-2 Synthetic Speech Transfer Learning for Speaker Recognition to Overcome Data Scarcity
Jordan J. BirdDiego R. FariaAnikó EkártCristiano PremebidaPedro P. S. Ayrosa
2020-07-01
The Summary Loop: Learning to Write Abstractive Summaries Without Examples
| Philippe LabanAndrew HsiJohn CannyMarti A. Hearst
2020-07-01
Image-level Harmonization of Multi-Site Data using Image-and-Spatial Transformer Networks
| R. RobinsonQ. DouD. C. CastroK. KamnitsasM. de GrootR. M. SummersD. RueckertB. Glocker
2020-06-30
GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding
| Dmitry LepikhinHyoukJoong LeeYuanzhong XuDehao ChenOrhan FiratYanping HuangMaxim KrikunNoam ShazeerZhifeng Chen
2020-06-30
Correction of Faulty Background Knowledge based on Condition Aware and Revise Transformer for Question Answering
Xinyan ZhaoXiao FengHaoming ZhongJun YaoHuanhuan Chen
2020-06-30
BERTERS: Multimodal Representation Learning for Expert Recommendation System with Transformer
N. Nikzad-KhasmakhiM. A. BalafarM. Reza Feizi-DerakhshiCina Motamed
2020-06-30
Simplifying Models with Unlabeled Output Data
Sang Michael XieTengyu MaPercy Liang
2020-06-29
Predicting Length of Stay in the Intensive Care Unit with Temporal Pointwise Convolutional Networks
| Emma RocheteauPietro LiòStephanie Hyland
2020-06-29
A Transformer-based joint-encoding for Emotion Recognition and Sentiment Analysis
| Jean-Benoit DelbrouckNoé TitsMathilde BrousmicheStéphane Dupont
2020-06-29
Multi-Head Attention: Collaborate Instead of Concatenate
| Jean-Baptiste CordonnierAndreas LoukasMartin Jaggi
2020-06-29
Knowledge-Aware Language Model Pretraining
Corby RossetChenyan XiongMinh PhanXia SongPaul BennettSaurabh Tiwary
2020-06-29
Interpreting Hierarchical Linguistic Interactions in DNNs
Die ZhangHuilin ZhouXiaoyi BaoDa HuoRuizhao ChenXu ChengHao ZhangMengyue WuQuanshi Zhang
2020-06-29
Rethinking Positional Encoding in Language Pre-training
| Guolin KeDi HeTie-Yan Liu
2020-06-28
Self-Attention Networks for Intent Detection
Sevinj YolchuyevaGéza NémethBálint Gyires-Tóth
2020-06-28
Bottom-Up Human Pose Estimation by Ranking Heatmap-Guided Adaptive Keypoint Estimates
| Ke SunZigang GengDepu MengBin XiaoDong LiuZhaoxiang ZhangJingdong Wang
2020-06-28
Progressive Generation of Long Text
| Bowen TanZichao YangMaruan AI-ShedivatEric P. XingZhiting Hu
2020-06-28
Mind The Facts: Knowledge-Boosted Coherent Abstractive Text Summarization
Beliz GunelChenguang ZhuMichael ZengXuedong Huang
2020-06-27
Video-Grounded Dialogues with Pretrained Generation Language Models
Hung LeSteven C. H. Hoi
2020-06-27
Normalizador Neural de Datas e Endereços
Gustavo PlensackPaulo Finardi
2020-06-27
What they do when in doubt: a study of inductive biases in seq2seq learners
| Eugene KharitonovRahma Chaabouni
2020-06-26
TURL: Table Understanding through Representation Learning
| Xiang DengHuan SunAlyssa LeesYou WuCong Yu
2020-06-26
BERTology Meets Biology: Interpreting Attention in Protein Language Models
| Jesse VigAli MadaniLav R. VarshneyCaiming XiongRichard SocherNazneen Fatema Rajani
2020-06-26
Conditional Set Generation with Transformers
Adam R KosiorekHyunjik KimDanilo J Rezende
2020-06-26
Streaming Transformer ASR with Blockwise Synchronous Inference
Emiru TsunooYosuke KashiwagiShinji Watanabe
2020-06-25
Learning Source Phrase Representations for Neural Machine Translation
Hongfei XuJosef van GenabithDeyi XiongQiuhui LiuJingyi Zhang
2020-06-25
Self-Segregating and Coordinated-Segregating Transformer for Focused Deep Multi-Modular Network for Visual Question Answering
Chiranjib Sur
2020-06-25
SACT: Self-Aware Multi-Space Feature Composition Transformer for Multinomial Attention for Video Captioning
Chiranjib Sur
2020-06-25
Differentiable Window for Dynamic Local Attention
Thanh-Tung NguyenXuan-Phi NguyenShafiq JotyXiaoli Li
2020-06-24
Hybrid Spatio-Temporal Graph Convolutional Network: Improving Traffic Prediction with Navigation Data
| Rui DaiShenkun XuQian GuChenguang JiKaikui Liu
2020-06-23
Bach or Mock? A Grading Function for Chorales in the Style of J.S. Bach
| Alexander FangAlisa LiuPrem SeetharamanBryan Pardo
2020-06-23
Self-supervised edge features for improved Graph Neural Network training
| Arijit SehanobishNeal G. RavindraDavid van Dijk
2020-06-23
A Self-Attention Network based Node Embedding Model
Dai Quoc NguyenTu Dinh NguyenDinh Phung
2020-06-22
Exploring Software Naturalness through Neural Language Models
Luca BurattiSaurabh PujarMihaela BorneaScott McCarleyYunhui ZhengGaetano RossielloAlessandro MorariJim LaredoVeronika ThostYufan ZhuangGiacomo Domeniconi
2020-06-22
AdvAug: Robust Adversarial Augmentation for Neural Machine Translation
Yong ChengLu JiangWolfgang MachereyJacob Eisenstein
2020-06-21
The NYU-CUBoulder Systems for SIGMORPHON 2020 Task 0 and Task 2
Assaf SingerKatharina Kann
2020-06-21
Off-Policy Self-Critical Training for Transformer in Visual Paragraph Generation
Shiyang YanYang HuaNeil M. Robertson
2020-06-21
A Universal Representation Transformer Layer for Few-Shot Image Classification
| Lu LiuWilliam HamiltonGuodong LongJing JiangHugo Larochelle
2020-06-21
Memory Transformer
Mikhail S. BurtsevGrigory V. Sapunov
2020-06-20
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
| Alexei BaevskiHenry ZhouAbdelrahman MohamedMichael Auli
2020-06-20
End-to-end deep metamodeling to calibrate and optimize energy loads
Max CohenMaurice CharbitSylvain Le CorffMarius PredaGilles Nozière
2020-06-19
A Qualitative Evaluation of Language Models on Automatic Question-Answering for COVID-19
| David OnianiYanshan Wang
2020-06-19
Boosting Objective Scores of Speech Enhancement Model through MetricGAN Post-Processing
Szu-Wei FuChien-Feng LiaoTsun-An HsiehKuo-Hsuan HungSyu-Siang WangCheng YuHeng-Cheng KuoRyandhimas E. ZezarioYou-Jin LiShang-Yi ChuangYen-Ju LuYu Tsao
2020-06-18
Multi-branch Attentive Transformer
| Yang FanShufang XieYingce XiaLijun WuTao QinXiang-Yang LiTie-Yan Liu
2020-06-18
I-BERT: Inductive Generalization of Transformer to Arbitrary Context Lengths
| Hyoungwook NamSeung Byum SeoVikram Sharma MailthodyNoor MichaelLan Li
2020-06-18
SEAL: Segment-wise Extractive-Abstractive Long-form Text Summarization
Yao ZhaoMohammad SalehPeter J. Liu
2020-06-18
Sparse GPU Kernels for Deep Learning
Trevor GaleMatei ZahariaCliff YoungErich Elsen
2020-06-18
SenWave: Monitoring the Global Sentiments under the COVID-19 Pandemic
Qiang YangHind AlamroSomayah AlbaradeiAdil SalhiXiaoting LvChangsheng MaManal AlshehriInji JaberFaroug TifrateneWei WangTakashi GojoboriCarlos M. DuarteXin GaoXiangliang Zhang
2020-06-18
Intelligent Protection & Classification of Transients in Two-Core Symmetric Phase Angle Regulating Transformers
Pallav Kumar BeraCan Isik
2020-06-17
Automatically Ranked Russian Paraphrase Corpus for Text Generation
Vadim GudkovOlga MitrofanovaElizaveta Filippskikh
2020-06-17
Learning Visual Commonsense for Robust Scene Graph Generation
Alireza ZareianZhecan WangHaoxuan YouShih-Fu Chang
2020-06-17
Cross-lingual Retrieval for Iterative Self-Supervised Training
| Chau TranYuqing TangXi-An LiJiatao Gu
2020-06-16
Modeling Graph Structure via Relative Position for Better Text Generation from Knowledge Graphs
Martin SchmittLeonardo F. R. RibeiroPhilipp DufterIryna GurevychHinrich Schütze
2020-06-16
Fine-grained Human Evaluation of Transformer and Recurrent Approaches to Neural Machine Translation for English-to-Chinese
| Yuying YeAntonio Toral
2020-06-15
On the Multi-Property Extraction and Beyond
Tomasz DwojakMichał PietruszkaŁukasz BorchmannFilip GralińskiJakub Chłędowski
2020-06-15
Exploration of End-to-End ASR for OpenSTT -- Russian Open Speech-to-Text Dataset
Andrei AndrusenkoAleksandr LaptevIvan Medennikov
2020-06-15
Differentiable Neural Architecture Transformation for Reproducible Architecture Improvement
Do-Guk KimHeung-Chang Lee
2020-06-15
Multi-Image Summarization: Textual Summary from a Set of Cohesive Images
Nicholas TrieuSebastian GoodmanPradyumna NarayanaKazoo SoneRadu Soricut
2020-06-15
Cooking Is All About People: Comment Classification On Cookery Channels Using BERT and Classification Models (Malayalam-English Mix-Code)
Subramaniam KazhuparambilAbhishek Kaushik
2020-06-15
Transferring Monolingual Model to Low-Resource Language: The Case of Tigrinya
Abrhalei TelaAbraham WoubieVille Hautamaki
2020-06-13
Guided Transformer: Leveraging Multiple External Sources for Representation Learning in Conversational Search
Helia HashemiHamed ZamaniW. Bruce Croft
2020-06-13
Temporal Fusion Network for Temporal Action Localization:Submission to ActivityNet Challenge 2020 (Task E)
Zhiwu QingXiang WangYongpeng SangChangxin GaoShiwei ZhangNong Sang
2020-06-13
Modelling High-Level Mathematical Reasoning in Mechanised Declarative Proofs
Wenda LiLei YuYuhuai WuLawrence C. Paulson
2020-06-13
Comparing Natural Language Processing Techniques for Alzheimer's Dementia Prediction in Spontaneous Speech
Thomas SearleZina IbrahimRichard Dobson
2020-06-12
Unmasking the Inductive Biases of Unsupervised Object Representations for Video Sequences
| Marissa A. WeisKashyap ChittaYash SharmaWieland BrendelMatthias BethgeAndreas GeigerAlexander S. Ecker
2020-06-12
Implicit Kernel Attention
Kyungwoo SongYohan JungDongjun KimIl-Chul Moon
2020-06-11
Dance Revolution: Long Sequence Dance Generation with Music via Curriculum Learning
| Ruozi HuangHuang HuWei WuKei SawadaMi Zhang
2020-06-11
FastPitch: Parallel Text-to-speech with Pitch Prediction
Adrian Łańcucki
2020-06-11
Extrapolation for Large-batch Training in Deep Learning
Tao LinLingjing KongSebastian U. StichMartin Jaggi
2020-06-10
Graph-Aware Transformer: Is Attention All Graphs Need?
Sanghyun YooYoung-Seok KimKang Hyun LeeKuhwan JeongJunhwi ChoiHoshik LeeYoung Sang Choi
2020-06-09
HausaMT v1.0: Towards English-Hausa Neural Machine Translation
Adewale Akinfaderin
2020-06-09
Unsupervised Paraphrase Generation using Pre-trained Language Models
Chaitra HegdeShrikumar Patil
2020-06-09
Few-Shot Generative Conversational Query Rewriting
| Shi YuJiahua LiuJingqin YangChenyan XiongPaul BennettJianfeng GaoZhiyuan Liu
2020-06-09
Linformer: Self-Attention with Linear Complexity
| Sinong WangBelinda Z. LiMadian KhabsaHan FangHao Ma
2020-06-08
Modeling Discourse Structure for Document-level Neural Machine Translation
Junxuan ChenXiang LiJiarui ZhangChulun ZhouJianwei CuiBin WangJinsong Su
2020-06-08
MultiSpeech: Multi-Speaker Text to Speech with Transformer
Mingjian ChenXu TanYi RenJin XuHao SunSheng ZhaoTao QinTie-Yan Liu
2020-06-08
Learning to Count Words in Fluent Speech enables Online Speech Recognition
| George SterpuChristian SaamNaomi Harte
2020-06-08
Wat zei je? Detecting Out-of-Distribution Translations with Variational Transformers
Tim Z. XiaoAidan N. GomezYarin Gal
2020-06-08
Learning Texture Transformer Network for Image Super-Resolution
| Fuzhi YangHuan YangJianlong FuHongtao LuBaining Guo
2020-06-07
Challenges and Thrills of Legal Arguments
Anurag PallaproluRadha VaidyaAditya Swaroop Attawar
2020-06-06
Masked Language Modeling for Proteins via Linearly Scalable Long-Context Transformers
Krzysztof ChoromanskiValerii LikhosherstovDavid DohanXingyou SongJared DavisTamas SarlosDavid BelangerLucy ColwellAdrian Weller
2020-06-05
GMAT: Global Memory Augmentation for Transformers
| Ankit GuptaJonathan Berant
2020-06-05
Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing
| Zihang DaiGuokun LaiYiming YangQuoc V. Le
2020-06-05
An Overview of Neural Network Compression
James O' Neill
2020-06-05
End-to-End Speech-Translation with Knowledge Distillation: [email protected]
Marco GaidoMattia Antonino Di GangiMatteo NegriMarco Turchi
2020-06-04
Automatic Text Summarization of COVID-19 Medical Research Articles using BERT and GPT-2
| Virapat KieuvongngamBowen TanYiming Niu
2020-06-03
On the Predictive Power of Neural Language Models for Human Real-Time Comprehension Behavior
Ethan Gotlieb WilcoxJon GauthierJennifer HuPeng QianRoger Levy
2020-06-02
Subjective Question Answering: Deciphering the inner workings of Transformers in the realm of subjectivity
Lukas Muttenthaler
2020-06-02
Approche de g\'en\'eration de r\'eponse \`a base de transformers (Transformer based approach for answer generation)
Imen AkermiJohannes HeineckeFr{\'e}d{\'e}ric Herledan
2020-06-01
Online Versus Offline NMT Quality: An In-depth Analysis on English-German and German-English
Maha ElbayadMichael UstaszewskiEmmanuelle Esperança-RodierFrancis Brunet ManquatLaurent Besacier
2020-06-01
Context-based Transformer Models for Answer Sentence Selection
Ivano LauriolaAlessandro Moschitti
2020-06-01
Unsupervised Sparse-view Backprojection via Convolutional and Spatial Transformer Networks
Xueqing LiuPaul Sajda
2020-06-01
Image Search With Text Feedback by Visiolinguistic Attention Learning
| Yanbei Chen Shaogang Gong Loris Bazzani
2020-06-01
Few-Shot Learning of Part-Specific Probability Space for 3D Shape Segmentation
Lingjing Wang Xiang Li Yi Fang
2020-06-01
RDCFace: Radial Distortion Correction for Face Recognition
He Zhao Xianghua Ying Yongjie Shi Xin Tong Jingsi Wen Hongbin Zha
2020-06-01
ActBERT: Learning Global-Local Video-Text Representations
Linchao Zhu Yi Yang
2020-06-01
Emergence of Separable Manifolds in Deep Language Representations
Jonathan MamouHang LeMiguel Del RioCory StephensonHanlin TangYoon KimSueYeon Chung
2020-06-01
ADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learning
| Zhewei YaoAmir GholamiSheng ShenKurt KeutzerMichael W. Mahoney
2020-06-01
BPGC at SemEval-2020 Task 11: Propaganda Detection in News Articles with Multi-Granularity Knowledge Sharing and Linguistic Features based Ensemble Learning
Rajaswa PatilSomesh SinghSwati Agarwal
2020-05-31
CNRL at SemEval-2020 Task 5: Modelling Causal Reasoning in Language with Multi-Head Self-Attention Weights based Counterfactual Detection
Rajaswa PatilVeeky Baths
2020-05-31
First Neural Conjecturing Datasets and Experiments
Josef UrbanJan Jakubův
2020-05-29
Using Large Pretrained Language Models for Answering User Queries from Product Specifications
Kalyani RoySmit ShahNithish PaiJaidam RamtejPrajit Prashant NadkarnJyotirmoy BanerjeePawan GoyalSurender Kumar
2020-05-29
A Comparative Study of Lexical Substitution Approaches based on Neural Language Models
Nikolay ArefyevBoris SheludkoAlexander PodolskiyAlexander Panchenko
2020-05-29
HAT: Hardware-Aware Transformers for Efficient Natural Language Processing
| Hanrui WangZhanghao WuZhijian LiuHan CaiLigeng ZhuChuang GanSong Han
2020-05-28
Variational Neural Machine Translation with Normalizing Flows
Hendra SetiawanMatthias SperberUdhay NallasamyMatthias Paulik
2020-05-28
Empirical Evaluation of Pretraining Strategies for Supervised Entity Linking
Thibault FévryNicholas FitzGeraldLivio Baldini SoaresTom Kwiatkowski
2020-05-28
Language Models are Few-Shot Learners
| Tom B. BrownBenjamin MannNick RyderMelanie SubbiahJared KaplanPrafulla DhariwalArvind NeelakantanPranav ShyamGirish SastryAmanda AskellSandhini AgarwalAriel Herbert-VossGretchen KruegerTom HenighanRewon ChildAditya RameshDaniel M. ZieglerJeffrey WuClemens WinterChristopher HesseMark ChenEric SiglerMateusz LitwinScott GrayBenjamin ChessJack ClarkChristopher BernerSam McCandlishAlec RadfordIlya SutskeverDario Amodei
2020-05-28
General-Purpose User Embeddings based on Mobile App Usage
| Junqi ZhangBing BaiYe LinJian LiangKun BaiFei Wang
2020-05-27
Permutation Matters: Anisotropic Convolutional Layer for Learning on Point Clouds
| Zhongpai GaoGuangtao ZhaiJunchi YanXiaokang Yang
2020-05-27
Insertion-Based Modeling for End-to-End Automatic Speech Recognition
Yuya FujitaShinji WatanabeMotoi OmachiXuankai Chan
2020-05-27
End-to-End Object Detection with Transformers
| Nicolas CarionFrancisco MassaGabriel SynnaeveNicolas UsunierAlexander KirillovSergey Zagoruyko
2020-05-26
GECToR -- Grammatical Error Correction: Tag, Not Rewrite
| Kostiantyn OmelianchukVitaliy AtrasevychArtem ChernodubOleksandr Skurzhanskyi
2020-05-26
Guiding Symbolic Natural Language Grammar Induction via Transformer-Based Sequence Probabilities
Ben GoertzelAndres Suarez MadrigalGino Yu
2020-05-26
Pay Attention to What You Read: Non-recurrent Handwritten Text-Line Recognition
Lei KangPau RibaMarçal RusiñolAlicia FornésMauricio Villegas
2020-05-26
Deep Learning Models for Automatic Summarization
Pirmin Lemberger
2020-05-25
The Unreasonable Volatility of Neural Machine Translation Models
| Marzieh FadaeeChristof Monz
2020-05-25
Adversarial NLI for Factual Correctness in Text Summarisation Models
Mario BarrantesBenedikt HerudekRichard Wang
2020-05-24
Stronger Baselines for Grammatical Error Correction Using Pretrained Encoder-Decoder Model
Satoru KatsumataMamoru Komachi
2020-05-24
Devising Malware Characterstics using Transformers
Simra ShahidTanmay SinghYash SharmaKapil Sharma
2020-05-23
Character-level Transformer-based Neural Machine Translation
Nikolay BanarWalter DaelemansMike Kestemont
2020-05-22
A Generative Approach to Titling and Clustering Wikipedia Sections
Anjalie FieldSascha RotheSimon BaumgartnerCong YuAbe Ittycheriah
2020-05-22
Low-Latency Sequence-to-Sequence Speech Recognition and Translation by Partial Hypothesis Selection
Danni LiuGerasimos SpanakisJan Niehues
2020-05-22
Transformer-based Context-aware Sarcasm Detection in Conversation Threads from Social Media
Xiangjue DongChangmao LiJinho D. Choi
2020-05-22
Simplified Self-Attention for Transformer-based End-to-End Speech Recognition
Haoneng LuoShiliang ZhangMing LeiLei Xie
2020-05-21
Leveraging Text Data Using Hybrid Transformer-LSTM Based End-to-End ASR in Transfer Learning
Zhiping ZengVan Tung PhamHaihua XuYerbolat KhassanovEng Siong ChngChongjia NiBin Ma
2020-05-21
Text-to-Text Pre-Training for Data-to-Text Tasks
| Mihir Kale
2020-05-21
Applying the Transformer to Character-level Transduction
Shijie WuRyan CotterellMans Hulden
2020-05-20
Relative Positional Encoding for Speech Recognition and Direct Translation
Ngoc-Quan PhamThanh-Le HaTuan-Nam NguyenThai-Son NguyenElizabeth SaleskySebastian StuekerJan NiehuesAlexander Waibel
2020-05-20
Rethinking Performance Estimation in Neural Architecture Search
| Xiawu ZhengRongrong JiQiang WangQixiang YeZhenguo LiYonghong TianQi Tian
2020-05-20
A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition
Dongwei JiangWubo LiRuixiong ZhangMiao CaoNe LuoYang HanWei ZouXiangang Li
2020-05-20
Creative Artificial Intelligence -- Algorithms vs. humans in an incentivized writing competition
Nils KöbisLuca Mossink
2020-05-20
Investigations on Phoneme-Based End-To-End Speech Recognition
Albert ZeyerWei ZhouThomas NgRalf SchlüterHermann Ney
2020-05-19
Comparing Transformers and RNNs on predicting human sentence processing data
Danny MerkxStefan L. Frank
2020-05-19
Should we hard-code the recurrence concept or learn it instead ? Exploring the Transformer architecture for Audio-Visual Speech Recognition
| George SterpuChristian SaamNaomi Harte
2020-05-19
Sketch-BERT: Learning Sketch Bidirectional Encoder Representation from Transformers by Self-supervised Learning of Sketch Gestalt
Hangyu LinYanwei FuYu-Gang JiangXiangyang Xue
2020-05-19
Exploring Transformers for Large-Scale Speech Recognition
Liang LuChangliang LiuJinyu LiYifan Gong
2020-05-19
A Transformer-based Embedding Model for Personalized Product Search
Keping BiQingyao AiW. Bruce Croft
2020-05-18
Efficient Wait-k Models for Simultaneous Machine Translation
Maha ElbayadLaurent BesacierJakob Verbeek
2020-05-18
Spatio-Temporal Graph Transformer Networks for Pedestrian Trajectory Prediction
Cunjun YuXiao MaJiawei RenHaiyu ZhaoShuai Yi
2020-05-18
Many-to-Many Voice Transformer Network
Hirokazu KameokaWen-Chin HuangKou TanakaTakuhiro KanekoNobukatsu HojoTomoki Toda
2020-05-18
GPT-too: A language-model-first approach for AMR-to-text generation
| Manuel MagerRamon Fernandez AstudilloTahira NaseemMd Arafat SultanYoung-Suk LeeRadu FlorianSalim Roukos
2020-05-18
Weak-Attention Suppression For Transformer Based Speech Recognition
Yangyang ShiYongqiang WangChunyang WuChristian FuegenFrank ZhangDuc LeChing-Feng YehMichael L. Seltzer
2020-05-18
Mask CTC: Non-Autoregressive End-to-End ASR with CTC and Mask Predict
Yosuke HiguchiShinji WatanabeNanxin ChenTetsuji OgawaTetsunori Kobayashi
2020-05-18
Building a Hebrew Semantic Role Labeling Lexical Resource from Parallel Movie Subtitles
Ben EyalMichael Elhadad
2020-05-17
Conformer: Convolution-augmented Transformer for Speech Recognition
| Anmol GulatiJames QinChung-Cheng ChiuNiki ParmarYu ZhangJiahui YuWei HanShibo WangZhengdong ZhangYonghui WuRuoming Pang
2020-05-16
Recurrent Chunking Mechanisms for Long-Text Machine Reading Comprehension
Hongyu GongYelong ShenDian YuJianshu ChenDong Yu
2020-05-16
Streaming Transformer-based Acoustic Models Using Self-attention with Augmented Memory
Chunyang WuYongqiang WangYangyang ShiChing-Feng YehFrank Zhang
2020-05-16
IntelliCode Compose: Code Generation Using Transformer
Alexey SvyatkovskiyShao Kun DengShengyu FuNeel Sundaresan
2020-05-16
Spike-Triggered Non-Autoregressive Transformer for End-to-End Speech Recognition
Zhengkun TianJiangyan YiJianhua TaoYe BaiShuai ZhangZhengqi Wen
2020-05-16
COVID-Twitter-BERT: A Natural Language Processing Model to Analyse COVID-19 Content on Twitter
| Martin MüllerMarcel SalathéPer E Kummervold
2020-05-15
Neural Entity Linking on Technical Service Tickets
Nadja KurzFelix HamannAdrian Ulges
2020-05-15
Finding Experts in Transformer Models
Xavier SuauLuca ZappellaNicholas Apostoloff
2020-05-15
JDI-T: Jointly trained Duration Informed Transformer for Text-To-Speech without Explicit Alignment
Dan LimWon JangGyeonghwan OHyeyeong ParkBongwan KimJesam Yoon
2020-05-15
The Unstoppable Rise of Computational Linguistics in Deep Learning
James Henderson
2020-05-13
Large Scale Multi-Actor Generative Dialog Modeling
Alex BoydRaul PuriMohammad ShoeybiMostofa PatwaryBryan Catanzaro
2020-05-13
Discriminative Multi-modality Speech Recognition
| Bo XuCheng LuYandong GuoJacob Wang
2020-05-12
Simultaneous paraphrasing and translation by fine-tuning Transformer models
Rakesh Chada
2020-05-12
SOLOIST: Few-shot Task-Oriented Dialog with A Single Pre-trained Auto-regressive Model
Baolin PengChunyuan LiJinchao LiShahin ShayandehLars LidenJianfeng Gao
2020-05-11
Hierarchical Attention Transformer Architecture For Syntactic Spell Correction
Abhishek NiranjanM Ali Basha ShaikKushal Verma
2020-05-11
Listen Attentively, and Spell Once: Whole Sentence Generation via a Non-Autoregressive Architecture for Low-Latency Speech Recognition
Ye BaiJiangyan YiJianhua TaoZhengkun TianZhengqi WenShuai Zhang
2020-05-11
MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
| Jie LeiLiwei WangYelong ShenDong YuTamara L. BergMohit Bansal
2020-05-11
On the Generation of Medical Dialogues for COVID-19
| Wenmian YangGuangtao ZengBowen TanZeqian JuSubrato ChakravortyXuehai HeShu ChenXingyi YangQingyang WuZhou YuEric XingPengtao Xie
2020-05-11
Epipolar Transformers
| Yihui HeRui YanKaterina FragkiadakiShoou-I Yu
2020-05-10
Transformer Based Language Models for Similar Text Retrieval and Ranking
Javed Qadrud-DinAshraf Bah RabiouRyan WalkerRavi SoniMartin GajekGabriel PackAkhil Rangaraj
2020-05-10
SocialTrans: A Deep Sequential Model with Social Information for Web-Scale Recommendation Systems
Qiaoan ChenHao GuLingling YiYishi LinPeng HeChuan ChenYangqiu Song
2020-05-09
It's Morphin' Time! Combating Linguistic Discrimination with Inflectional Perturbations
| Samson TanShafiq JotyMin-Yen KanRichard Socher
2020-05-09
schuBERT: Optimizing Elements of BERT
Ashish KhetanZohar Karnin
2020-05-09
Character Matters: Video Story Understanding with Character-Aware Relations
Shijie GengJi ZhangZuohui FuPeng GaoHang ZhangGerard de Melo
2020-05-09
Mapping Natural Language Instructions to Mobile UI Action Sequences
| Yang LiJiacong HeXin ZhouYuan ZhangJason Baldridge
2020-05-07
A Systematic Assessment of Syntactic Generalization in Neural Language Models
Jennifer HuJon GauthierPeng QianEthan WilcoxRoger P. Levy
2020-05-07
An Empirical Study of Multi-Task Learning on BERT for Biomedical Text Mining
Yifan PengQingyu ChenZhiyong Lu
2020-05-06
The Cascade Transformer: an Application for Efficient Answer Sentence Selection
| Luca SoldainiAlessandro Moschitti
2020-05-05
Dynamically Adjusting Transformer Batch Size by Monitoring Gradient Direction Change
Hongfei XuJosef van GenabithDeyi XiongQiuhui Liu
2020-05-05
OpinionDigest: A Simple Framework for Opinion Summarization
Yoshihiko SuharaXiaolan WangStefanos AngelidisWang-Chiew Tan
2020-05-05
ImpactCite: An XLNet-based method for Citation Impact Analysis
Dominique MercierSyed Tahseen Raza RizviVikas RajashekarAndreas DengelSheraz Ahmed
2020-05-05
Distributional Discrepancy: A Metric for Unconditional Text Generation
| Ping CaiXingyuan ChenPeng JinHongjun WangTianrui Li
2020-05-04
Spying on your neighbors: Fine-grained probing of contextual embeddings for information about surrounding words
Josef KlafkaAllyson Ettinger
2020-05-04
Successfully Applying the Stabilized Lottery Ticket Hypothesis to the Transformer Architecture
Christopher BrixParnia BaharHermann Ney
2020-05-04
An Accurate Model for Predicting the (Graded) Effect of Context in Word Similarity Based on Bert
Wei BaoHongshu CheJiandong Zhang
2020-05-03
Towards Faithful Neural Table-to-Text Generation with Content-Matching Constraints
Zhenyi WangXiaoyang WangBang AnDong YuChangyou Chen
2020-05-03
Dynamic Programming Encoding for Subword Segmentation in Neural Machine Translation
| Xuanli HeGholamreza HaffariMohammad Norouzi
2020-05-03
Transformer-based End-to-End Question Generation
| Luis Enrico LopezDiane Kathryn CruzJan Christian Blaise CruzCharibeth Cheng
2020-05-03
Quantifying Attention Flow in Transformers
| Samira AbnarWillem Zuidema
2020-05-02
Measuring and Reducing Non-Multifact Reasoning in Multi-hop Question Answering
Harsh TrivediNiranjan BalasubramanianTushar KhotAshish Sabharwal
2020-05-02
Synthesizer: Rethinking Self-Attention in Transformer Models
Yi TayDara BahriDonald MetzlerDa-Cheng JuanZhe ZhaoChe Zheng
2020-05-02
Hard-Coded Gaussian Attention for Neural Machine Translation
Weiqiu YouSimeng SunMohit Iyyer
2020-05-02
Contrastive Self-Supervised Learning for Commonsense Reasoning
| Tassilo KleinMoin Nabi
2020-05-02
DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering
| Qingqing CaoHarsh TrivediAruna BalasubramanianNiranjan Balasubramanian
2020-05-02
A Simple Language Model for Task-Oriented Dialogue
| Ehsan Hosseini-AslBryan McCannChien-Sheng WuSemih YavuzRichard Socher
2020-05-02
Kaleidoscope: An Efficient, Learnable Representation For All Structured Linear Maps
| Tri DaoNimit SohoniAlbert GuMatthew EichhornAmit BlonderMegan LeszczynskiAtri RudraChristopher Ré
2020-05-01
Global Relational Models of Source Code
Vincent J. HellendoornCharles SuttonRishabh SinghPetros ManiatisDavid Bieber
2020-05-01
Logic and the 2-Simplicial Transformer
| James CliftDmitry DorynDaniel MurfetJames Wallbridge
2020-05-01
Controllable Sentence Simplification
| Louis Martin{\'E}ric de la ClergerieBeno{\^\i}t SagotAntoine Bordes
2020-05-01
The AVA-Kinetics Localized Human Actions Video Dataset
Ang LiMeghana ThotakuriDavid A. RossJoão CarreiraAlexander VostrikovAndrew Zisserman
2020-05-01
HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training
Linjie LiYen-Chun ChenYu ChengZhe GanLicheng YuJingjing Liu
2020-05-01
A Transformer-based Approach for Source Code Summarization
| Wasi Uddin AhmadSaikat ChakrabortyBaishakhi RayKai-Wei Chang
2020-05-01
A Controllable Model of Grounded Response Generation
Zeqiu WuMichel GalleyChris BrockettYizhe ZhangXiang GaoChris QuirkRik Koncel-KedziorskiJianfeng GaoHannaneh HajishirziMari OstendorfBill Dolan
2020-05-01
Multi-scale Transformer Language Models
Sandeep SubramanianRonan CollobertMarc'Aurelio RanzatoY-Lan Boureau
2020-05-01
POINTER: Constrained Text Generation via Insertion-based Generative Pre-training
| Yizhe ZhangGuoyin WangChunyuan LiZhe GanChris BrockettBill Dolan
2020-05-01
Event Clustering within News Articles
Faik Kerem {\"O}rsS{\"u}veyda YeniterziReyyan Yeniterzi
2020-05-01
Detecting Direct Speech in Multilingual Collection of 19th-century Novels
Joanna ByszukMicha{\l} Wo{\'z}niakMike KestemontAlbert Le{\'s}niakWojciech {\L}ukasikArtjoms {\v{S}}e{\c{l}}aMaciej Eder
2020-05-01
ASU\_OPTO at OSACT4 - Offensive Language Detection for Arabic text
Amr KelegSamhaa R. El-BeltagyMahmoud Khalil
2020-05-01
Scaling Language Data Import/Export with a Data Transformer Interface
Nicholas BuckeridgeBen Foley
2020-05-01
Aggression Identification in Social Media: a Transfer Learning Based Approach
RamiFaneva risoaJosiane Mothe
2020-05-01
IRIT at TRAC 2020
RamiFaneva risoaJosiane Mothe
2020-05-01
Multilingual Joint Fine-tuning of Transformer models for identifying Trolling, Aggression and Cyberbullying at TRAC 2020
| Sudhanshu MishraShivangi PrasadShubhanshu Mishra
2020-05-01
On the Influence of Coreference Resolution on Word Embeddings in Lexical-semantic Evaluation Tasks
Alex HenleinerAlex Mehlerer
2020-05-01
Chinese Discourse Parsing: Model and Evaluation
Lin Chuan-AnShyh-Shiun HungHen-Hsen HuangHsin-Hsi Chen
2020-05-01
DecOp: A Multilingual and Multi-domain Corpus For Detecting Deception In Typed Text
Pasquale CapuozzoIvano LauriolaCarlo StrapparavaFabio AiolliGiuseppe Sartori
2020-05-01
Paraphrase Generation and Evaluation on Colloquial-Style Sentences
Eetu Sj{\"o}blomMathias CreutzYves Scherrer
2020-05-01
Building a Task-oriented Dialog System for Languages with no Training Data: the Case for Basque
Maddalen L{\'o}pez de LacalleXabier SaralegiI{\~n}aki San Vicente
2020-05-01
Linguistically Informed Hindi-English Neural Machine Translation
Vikrant GoyalPruthwik MishraDipti Misra Sharma
2020-05-01
Corpora for Document-Level Neural Machine Translation
Siyou LiuXiaojun Zhang
2020-05-01
Multilingual Corpus Creation for Multilingual Semantic Similarity Task
Mahtab AhmedChahna DixitRobert E. MercerAtif KhanMuhammad Rifayat SameeFelipe Urra
2020-05-01
Exploring Transformer Text Generation for Medical Dataset Augmentation
Ali Amin-NejadJulia IveSumithra Velupillai
2020-05-01
Much Ado About Nothing -- Identification of Zero Copulas in Hungarian Using an NMT Model
Andrea D{\"o}m{\"o}t{\"o}rZijian Gy{\H{o}}z{\H{o}} YangAttila Nov{\'a}k
2020-05-01
ParlVote: A Corpus for Sentiment Analysis of Political Debates
Gavin AbercrombieRiza Batista-Navarro
2020-05-01
Cross-lingual and Cross-domain Evaluation of Machine Reading Comprehension with Squad and CALOR-Quest Corpora
Delphine CharletGeraldine DamnatiFrederic Bechetgabriel marzinottoJohannes Heinecke
2020-05-01
Contextualized Embeddings based Transformer Encoder for Sentence Similarity Modeling in Answer Selection Task
| Md Tahmid Rahman LaskarJimmy Xiangji HuangEnamul Hoque
2020-05-01
Evaluation of Dataset Selection for Pre-Training and Fine-Tuning Transformer Language Models for Clinical Question Answering
Sarvesh SoniKirk Roberts
2020-05-01
Minority Positive Sampling for Switching Points - an Anecdote for the Code-Mixing Language Modeling
Arindam ChatterjereVineeth GupthaParul ChopraAmitava Das
2020-05-01
Seq2SeqPy: A Lightweight and Customizable Toolkit for Neural Sequence-to-Sequence Modeling
Raheel QaderFran{\c{c}}ois PortetCyril Labbe
2020-05-01
Transfer learning applied to text classification in Spanish radiological reports
Pilar L{\'o}pez {\'U}bedaManuel Carlos D{\'\i}az-GalianoL. Alfonso Urena LopezMaite MartinTeodoro Mart{\'\i}n-NoguerolAntonio Luna
2020-05-01
``A Passage to India'': Pre-trained Word Embeddings for Indian Languages
Saurav KumarSaunack KumarDiptesh KanojiaPushpak Bhattacharyya
2020-05-01
Evaluation Metrics for Headline Generation Using Deep Pre-Trained Embeddings
Abdul MoeedYang AnGerhard HagererGeorg Groh
2020-05-01
SiBert: Enhanced Chinese Pre-trained Language Model with Sentence Insertion
Jiahao ChenChenjie CaoXiuyan Jiang
2020-05-01
Evaluating the Impact of Sub-word Information and Cross-lingual Word Embeddings on Mi'kmaq Language Modelling
Jeremie BoudreauAkankshya PatraAshima SuvarnaPaul Cook
2020-05-01
KLEJ: Comprehensive Benchmark for Polish Language Understanding
| Piotr RybakRobert MroczkowskiJanusz TraczIreneusz Gawlik
2020-05-01
Few-Shot Learning for Opinion Summarization
Arthur BražinskasMirella LapataIvan Titov
2020-04-30
SegaBERT: Pre-training of Segment-aware BERT for Language Understanding
He BaiPeng ShiJimmy LinLuchen TanKun XiongWen GaoMing Li
2020-04-30
PlotMachines: Outline-Conditioned Generation with Dynamic Plot State Tracking
Hannah RashkinAsli CelikyilmazYejin ChoiJianfeng Gao
2020-04-30
Addressing Zero-Resource Domains Using Document-Level Context in Neural Machine Translation
Dario StojanovskiAlexander Fraser
2020-04-30
Progressive Transformers for End-to-End Sign Language Production
| Ben SaundersNecati Cihan CamgozRichard Bowden
2020-04-30
Accurate Word Alignment Induction from Neural Machine Translation
Yun ChenYang LiuGuanhua ChenXin JiangQun Liu
2020-04-30
Character-Level Translation with Self-attention
Yingqiang GaoNikola I. NikolovYuhuang HuRichard H. R. Hahnloser
2020-04-30
Semantic Triple Encoder for Fast Open-Set Link Prediction
Bo WangTao ShenGuodong LongTianyi ZhouYi Chang
2020-04-30
Self-Supervised and Controlled Multi-Document Opinion Summarization
Hady ElsaharMaximin CoavouxMatthias GalléJos Rozen
2020-04-30
End-to-End Neural Word Alignment Outperforms GIZA++
Thomas ZenkelJoern WuebkerJohn DeNero
2020-04-30
Capsule-Transformer for Neural Machine Translation
Sufeng DuanJuncheng CaoHai Zhao
2020-04-30
Breaking (Global) Barriers in Parallel Stochastic Optimization with Wait-Avoiding Group Averaging
Shigang LiTal Ben-NunGiorgi NadiradzeSalvatore Di GirolamoNikoli DrydenDan AlistarhTorsten Hoefler
2020-04-30
Recipes for Adapting Pre-trained Monolingual and Multilingual Models to Machine Translation
Asa Cooper SticklandXian LiMarjan Ghazvininejad
2020-04-30
Towards Character-Level Transformer NMT by Finetuning Subword Systems
Jindřich LibovickýAlexander Fraser
2020-04-29
Efficient Document Re-Ranking for Transformers by Precomputing Term Representations
| Sean MacAvaneyFranco Maria NardiniRaffaele PeregoNicola TonellottoNazli GoharianOphir Frieder
2020-04-29
GePpeTto Carves Italian into a Language Model
| Lorenzo De MatteiMichele CafagnaFelice Dell'OrlettaMalvina NissimMarco Guerini
2020-04-29
Image Captioning through Image Transformer
Sen HeWentong LiaoHamed R. TavakoliMichael YangBodo RosenhahnNicolas Pugeault
2020-04-29
Pre-training Is (Almost) All You Need: An Application to Commonsense Reasoning
Alexandre TamborrinoNicola PellicanoBaptiste PannierPascal VoitotLouise Naudin
2020-04-29
Image Morphing with Perceptual Constraints and STN Alignment
Noa FishRichard ZhangLilach PerryDaniel Cohen-OrEli ShechtmanConnelly Barnes
2020-04-29
Multiresolution and Multimodal Speech Recognition with Transformers
Georgios ParaskevopoulosSrinivas ParthasarathyAparna KhareShiva Sundaram
2020-04-29
$R^3$: Reverse, Retrieve, and Rank for Sarcasm Generation with Commonsense Knowledge
| Tuhin ChakrabartyDebanjan GhoshSmaranda MuresanNanyun Peng
2020-04-28
EARL: Speedup Transformer-based Rankers with Pre-computed Representation
Luyu GaoZhuyun DaiJamie Callan
2020-04-28
VD-BERT: A Unified Vision and Dialog Transformer with BERT
Yue WangShafiq JotyMichael R. LyuIrwin KingCaiming XiongSteven C. H. Hoi
2020-04-28
DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference
| Ji XinRaphael TangJaejun LeeYaoliang YuJimmy Lin
2020-04-27
LightPAFF: A Two-Stage Distillation Framework for Pre-training and Fine-tuning
Kaitao SongHao SunXu TanTao QinJianfeng LuHongzhi LiuTie-Yan Liu
2020-04-27
Augmenting Transformers with KNN-Based Composite Memory for Dialogue
Angela FanClaire GardentChloe BraudAntoine Bordes
2020-04-27
Lexically Constrained Neural Machine Translation with Levenshtein Transformer
| Raymond Hendy SusantoShamil ChollampattLiling Tan
2020-04-27
Explicitly Modeling Adaptive Depths for Transformer
Yijin LiuFandong MengJie ZhouYufeng ChenJinan Xu
2020-04-27
Assessing Discourse Relations in Language Generation from Pre-trained Language Models
Wei-Jen KoJunyi Jessy Li
2020-04-26
Experiments with LVT and FRE for Transformer model
Ilshat GibadullinAidar Valeev
2020-04-26
Causal Mediation Analysis for Interpreting Neural NLP: The Case of Gender Bias
Jesse VigSebastian GehrmannYonatan BelinkovSharon QianDaniel NevoYaron SingerStuart Shieber
2020-04-26
Research on Modeling Units of Transformer Transducer for Mandarin Speech Recognition
Li FuXiaoxiao LiLibo Zi
2020-04-26
Choppy: Cut Transformer For Ranked List Truncation
Dara BahriYi TayChe ZhengDonald MetzlerAndrew Tomkins
2020-04-26
Combining Word Embeddings and N-grams for Unsupervised Document Summarization
Zhuolin JiangManaj SrivastavaSanjay KrishnaDavid AkodesRichard Schwartz
2020-04-25
All Word Embeddings from One Embedding
| Sho TakaseSosuke Kobayashi
2020-04-25
Lite Transformer with Long-Short Range Attention
| Zhanghao WuZhijian LiuJi LinYujun LinSong Han
2020-04-24
On Sparsifying Encoder Outputs in Sequence-to-Sequence Models
Biao ZhangIvan TitovRico Sennrich
2020-04-24
FLAT: Chinese NER Using Flat-Lattice Transformer
Xiaonan LiHang YanXipeng QiuXuanjing Huang
2020-04-24
Understanding when spatial transformer networks do not support invariance, and what to do about it
Lukas FinnvedenYlva JanssonTony Lindeberg
2020-04-24
A Tailored Pre-Training Model for Task-Oriented Dialog Generation
Jing GuQingyang WuChongruo WuWeiyan ShiZhou Yu
2020-04-24
Cross-lingual Information Retrieval with BERT
Zhuolin JiangAmro El-JaroudiWilliam HartmannDamianos KarakosLingjun Zhao
2020-04-24
UHH-LT at SemEval-2020 Task 12: Fine-Tuning of Pre-Trained Transformer Networks for Offensive Language Detection
Gregor WiedemannSeid Muhie YimamChris Biemann
2020-04-23
MolTrans: Molecular Interaction Transformer for Drug Target Interaction Prediction
Kexin HuangCao XiaoLucas GlassJimeng Sun
2020-04-23
Self-Attention Attribution: Interpreting Information Interactions Inside Transformer
Yaru HaoLi DongFuru WeiKe Xu
2020-04-23
Towards a Competitive End-to-End Speech Recognition for CHiME-6 Dinner Party Transcription
Andrei AndrusenkoAleksandr LaptevIvan Medennikov
2020-04-22
Logical Natural Language Generation from Open-Domain Tables
| Wenhu ChenJianshu ChenYu SuZhiyu ChenWilliam Yang Wang
2020-04-22
Attention is Not Only a Weight: Analyzing Transformers with Vector Norms
| Goro KobayashiTatsuki KuribayashiSho YokoiKentaro Inui
2020-04-21
Vector Quantized Contrastive Predictive Coding for Template-based Music Generation
| Gaëtan HadjeresLéopold Crestel
2020-04-21
Joint Cross-Modality Super Resolution
Guy ShachtSharon FogelDov DanonDaniel Cohen-OrIlya Leizerson
2020-04-21
DIET: Lightweight Language Understanding for Dialogue Systems
| Tanja BunkDaksh VarshneyaVladimir VlasovAlan Nichol
2020-04-21
Contextual Neural Machine Translation Improves Translation of Cataphoric Pronouns
KayYen WongSameen MarufGholamreza Haffari
2020-04-21
Keyphrase Generation with Cross-Document Attention
| Shizhe DiaoYan SongTong Zhang
2020-04-21
Mirror Ritual: An Affective Interface for Emotional Self-Reflection
Nina RajcicJon McCormack
2020-04-21
Learning Local Neighboring Structure for Robust 3D Shape Representation
| Zhongpai GaoGuangtao ZhaiJuyong ZhangJunchi YanYiyan YangXiaokang Yang
2020-04-21
A Review-based Transformer Model for Personalized Product Search
Keping BiQingyao AiW. Bruce Croft
2020-04-20
WHALETRANS: E2E WHisper to nAturaL spEech conversion using modified TRANSformer network
Abhishek NiranjanMukesh SharmaSai Bharath Chandra GuthaM Ali Basha Shaik
2020-04-20
Transformer Reasoning Network for Image-Text Matching and Retrieval
| Nicola MessinaFabrizio FalchiAndrea EsuliGiuseppe Amato
2020-04-20
StereoSet: Measuring stereotypical bias in pretrained language models
| Moin NadeemAnna BethkeSiva Reddy
2020-04-20
MPNet: Masked and Permuted Pre-training for Language Understanding
| Kaitao SongXu TanTao QinJianfeng LuTie-Yan Liu
2020-04-20
Deep-COVID: Predicting COVID-19 From Chest X-Ray Images Using Deep Transfer Learning
| Shervin MinaeeRahele KafiehMilan SonkaShakib YazdaniGhazaleh Jamalipour Soufi
2020-04-20
Motion Segmentation using Frequency Domain Transformer Networks
Hafez FaraziSven Behnke
2020-04-18
Understanding the Difficulty of Training Transformers
| Liyuan LiuXiaodong LiuJianfeng GaoWeizhu ChenJiawei Han
2020-04-17