Search Results for author: Wenlin Wang

Found 30 papers, 7 papers with code

JIZHI: A Fast and Cost-Effective Model-As-A-Service System for Web-Scale Online Inference at Baidu

1 code implementation • 3 Jun 2021 • Hao liu, Qian Gao, Jiang Li, Xiaochao Liao, Hao Xiong, Guangxing Chen, Wenlin Wang, Guobao Yang, Zhiwei Zha, daxiang dong, Dejing Dou, Haoyi Xiong

In this work, we present JIZHI - a Model-as-a-Service system - that per second handles hundreds of millions of online inference requests to huge deep models with more than trillions of sparse parameters, for over twenty real-time recommendation services at Baidu, Inc.

Recommendation Systems

878

Paper
Code

Improving Text Generation with Student-Forcing Optimal Transport

no code implementations • EMNLP 2020 • Guoyin Wang, Chunyuan Li, Jianqiao Li, Hao Fu, Yuh-Chen Lin, Liqun Chen, Yizhe Zhang, Chenyang Tao, Ruiyi Zhang, Wenlin Wang, Dinghan Shen, Qian Yang, Lawrence Carin

An extension is further proposed to improve the OT learning, based on the structural and contextual information of the text sequences.

Machine Translation Text Generation +2

Paper
Add Code

Improving Adversarial Text Generation by Modeling the Distant Future

no code implementations • ACL 2020 • Ruiyi Zhang, Changyou Chen, Zhe Gan, Wenlin Wang, Dinghan Shen, Guoyin Wang, Zheng Wen, Lawrence Carin

Auto-regressive text generation models usually focus on local fluency, and may cause inconsistent semantic meaning in long text generation.

Adversarial Text Imitation Learning +1

Paper
Add Code

Nested-Wasserstein Self-Imitation Learning for Sequence Generation

no code implementations • 20 Jan 2020 • Ruiyi Zhang, Changyou Chen, Zhe Gan, Zheng Wen, Wenlin Wang, Lawrence Carin

Reinforcement learning (RL) has been widely studied for improving sequence-generation models.

Imitation Learning reinforcement-learning +1

Paper
Add Code

Graph-Driven Generative Models for Heterogeneous Multi-Task Learning

no code implementations • 20 Nov 2019 • Wenlin Wang, Hongteng Xu, Zhe Gan, Bai Li, Guoyin Wang, Liqun Chen, Qian Yang, Wenqi Wang, Lawrence Carin

We propose a novel graph-driven generative model, that unifies multiple heterogeneous learning tasks into the same framework.

Multi-Task Learning Type prediction

Paper
Add Code

An End-to-End Generative Architecture for Paraphrase Generation

no code implementations • IJCNLP 2019 • Qian Yang, Zhouyuan Huo, Dinghan Shen, Yong Cheng, Wenlin Wang, Guoyin Wang, Lawrence Carin

Generating high-quality paraphrases is a fundamental yet challenging natural language processing task.

Paraphrase Generation

Paper
Add Code

Learning to Recommend from Sparse Data via Generative User Feedback

no code implementations • ICLR 2020 • Wenlin Wang, Hongteng Xu, Ruiyi Zhang, Wenqi Wang, Piyush Rai, Lawrence Carin

To address this, we propose a learning framework that improves collaborative filtering with a synthetic feedback loop (CF-SFL) to simulate the user feedback.

Collaborative Filtering Recommendation Systems

Paper
Add Code

Zero-Shot Recognition via Optimal Transport

no code implementations • 20 Oct 2019 • Wenlin Wang, Hongteng Xu, Guoyin Wang, Wenqi Wang, Lawrence Carin

{Specifically, we build a conditional generative model to generate features from seen-class attributes, and establish an optimal transport between the distribution of the generated features and that of the real features.}

Attribute Generalized Zero-Shot Learning

Paper
Add Code

Improving Textual Network Learning with Variational Homophilic Embeddings

1 code implementation • NeurIPS 2019 • Wenlin Wang, Chenyang Tao, Zhe Gan, Guoyin Wang, Liqun Chen, Xinyuan Zhang, Ruiyi Zhang, Qian Yang, Ricardo Henao, Lawrence Carin

This paper considers a novel variational formulation of network embeddings, with special focus on textual networks.

Network Embedding

Paper
Code

Ouroboros: On Accelerating Training of Transformer-Based Language Models

1 code implementation • NeurIPS 2019 • Qian Yang, Zhouyuan Huo, Wenlin Wang, Heng Huang, Lawrence Carin

Model parallelism is required if a model is too large to fit in a single computing device.

Language Modelling Machine Translation +2

Paper
Code

Improving Textual Network Embedding with Global Attention via Optimal Transport

no code implementations • ACL 2019 • Liqun Chen, Guoyin Wang, Chenyang Tao, Dinghan Shen, Pengyu Cheng, Xinyuan Zhang, Wenlin Wang, Yizhe Zhang, Lawrence Carin

Constituting highly informative network embeddings is an important tool for network analysis.

Network Embedding

Paper
Add Code

Topic-Guided Variational Auto-Encoder for Text Generation

no code implementations • NAACL 2019 • Wenlin Wang, Zhe Gan, Hongteng Xu, Ruiyi Zhang, Guoyin Wang, Dinghan Shen, Changyou Chen, Lawrence Carin

We propose a topic-guided variational auto-encoder (TGVAE) model for text generation.

Conditional Text Generation

Paper
Add Code

On Norm-Agnostic Robustness of Adversarial Training

no code implementations • 15 May 2019 • Bai Li, Changyou Chen, Wenlin Wang, Lawrence Carin

Adversarial examples are carefully perturbed in-puts for fooling machine learning models.

BIG-bench Machine Learning

Paper
Add Code

Second-Order Adversarial Attack and Certifiable Robustness

no code implementations • ICLR 2019 • Bai Li, Changyou Chen, Wenlin Wang, Lawrence Carin

In this paper, we propose a powerful second-order attack method that reduces the accuracy of the defense model by Madry et al. (2017).

Adversarial Attack

Paper
Add Code

Topic-Guided Variational Autoencoders for Text Generation

no code implementations • 17 Mar 2019 • Wenlin Wang, Zhe Gan, Hongteng Xu, Ruiyi Zhang, Guoyin Wang, Dinghan Shen, Changyou Chen, Lawrence Carin

We propose a topic-guided variational autoencoder (TGVAE) model for text generation.

Conditional Text Generation

Paper
Add Code

Sequence Generation with Guider Network

no code implementations • 2 Nov 2018 • Ruiyi Zhang, Changyou Chen, Zhe Gan, Wenlin Wang, Liqun Chen, Dinghan Shen, Guoyin Wang, Lawrence Carin

Sequence generation with reinforcement learning (RL) has received significant attention recently.

Reinforcement Learning (RL)

Paper
Add Code

Distilled Wasserstein Learning for Word Embedding and Topic Modeling

no code implementations • NeurIPS 2018 • Hongteng Xu, Wenlin Wang, Wei Liu, Lawrence Carin

When learning the topic model, we leverage a distilled underlying distance matrix to update the topic distributions and smoothly calculate the corresponding optimal transports.

Mortality Prediction Word Embeddings

Paper
Add Code

Certified Adversarial Robustness with Additive Noise

3 code implementations • NeurIPS 2019 • Bai Li, Changyou Chen, Wenlin Wang, Lawrence Carin

The existence of adversarial data examples has drawn significant attention in the deep-learning community; such data are seemingly minimally perturbed relative to the original data, but lead to very different outputs from a deep-learning algorithm.

Adversarial Attack Adversarial Robustness

353

Paper
Code

A Unified Particle-Optimization Framework for Scalable Bayesian Sampling

no code implementations • 29 May 2018 • Changyou Chen, Ruiyi Zhang, Wenlin Wang, Bai Li, Liqun Chen

There has been recent interest in developing scalable Bayesian sampling methods such as stochastic gradient MCMC (SG-MCMC) and Stein variational gradient descent (SVGD) for big-data analysis.

Paper
Add Code

Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms

2 code implementations • ACL 2018 • Dinghan Shen, Guoyin Wang, Wenlin Wang, Martin Renqiang Min, Qinliang Su, Yizhe Zhang, Chunyuan Li, Ricardo Henao, Lawrence Carin

Many deep learning architectures have been proposed to model the compositionality in text sequences, requiring a substantial number of parameters and expensive computations.

Ranked #1 on Named Entity Recognition (NER) on CoNLL 2000

Document Classification General Classification +4

284

Paper
Code

NASH: Toward End-to-End Neural Architecture for Generative Semantic Hashing

1 code implementation • ACL 2018 • Dinghan Shen, Qinliang Su, Paidamoyo Chapfuwa, Wenlin Wang, Guoyin Wang, Lawrence Carin, Ricardo Henao

Semantic hashing has become a powerful paradigm for fast similarity search in many information retrieval systems.

Information Retrieval Retrieval +1

Paper
Code

Joint Embedding of Words and Labels for Text Classification

2 code implementations • ACL 2018 • Guoyin Wang, Chunyuan Li, Wenlin Wang, Yizhe Zhang, Dinghan Shen, Xinyuan Zhang, Ricardo Henao, Lawrence Carin

Word embeddings are effective intermediate representations for capturing semantic regularities between words, when learning the representations of text sequences.

Ranked #11 on Text Classification on DBpedia

General Classification Sentiment Analysis +2

323

Paper
Code

Wide Compression: Tensor Ring Nets

no code implementations • CVPR 2018 • Wenqi Wang, Yifan Sun, Brian Eriksson, Wenlin Wang, Vaneet Aggarwal

Deep neural networks have demonstrated state-of-the-art performance in a variety of real-world applications.

Image Classification

Paper
Add Code

On the Use of Word Embeddings Alone to Represent Natural Language Sequences

no code implementations • ICLR 2018 • Dinghan Shen, Guoyin Wang, Wenlin Wang, Martin Renqiang Min, Qinliang Su, Yizhe Zhang, Ricardo Henao, Lawrence Carin

In this paper, we conduct an extensive comparative study between Simple Word Embeddings-based Models (SWEMs), with no compositional parameters, relative to employing word embeddings within RNN/CNN-based models.

Sentence Word Embeddings

Paper
Add Code

Topic Compositional Neural Language Model

no code implementations • 28 Dec 2017 • Wenlin Wang, Zhe Gan, Wenqi Wang, Dinghan Shen, Jiaji Huang, Wei Ping, Sanjeev Satheesh, Lawrence Carin

The TCNLM learns the global semantic coherence of a document via a neural topic model, and the probability of each learned latent topic is further used to build a Mixture-of-Experts (MoE) language model, where each expert (corresponding to one topic) is a recurrent neural network (RNN) that accounts for learning the local structure of a word sequence.

Language Modelling

Paper
Add Code

InverseNet: Solving Inverse Problems with Splitting Networks

no code implementations • 1 Dec 2017 • Kai Fan, Qi Wei, Wenlin Wang, Amit Chakraborty, Katherine Heller

We propose a new method that uses deep learning techniques to solve the inverse problems.

Colorization Deblurring +2

Paper
Add Code

Zero-Shot Learning via Class-Conditioned Deep Generative Models

no code implementations • 15 Nov 2017 • Wenlin Wang, Yunchen Pu, Vinay Kumar Verma, Kai Fan, Yizhe Zhang, Changyou Chen, Piyush Rai, Lawrence Carin

We present a deep generative model for learning to predict classes not seen at training time.

Few-Shot Learning Zero-Shot Learning

Paper
Add Code

Continuous-Time Flows for Efficient Inference and Density Estimation

no code implementations • ICML 2018 • Changyou Chen, Chunyuan Li, Liqun Chen, Wenlin Wang, Yunchen Pu, Lawrence Carin

Distinct from normalizing flows and GANs, CTFs can be adopted to achieve the above two goals in one framework, with theoretical guarantees.

Density Estimation

Paper
Add Code

A Convergence Analysis for A Class of Practical Variance-Reduction Stochastic Gradient MCMC

no code implementations • 4 Sep 2017 • Changyou Chen, Wenlin Wang, Yizhe Zhang, Qinliang Su, Lawrence Carin

However, there has been little theoretical analysis of the impact of minibatch size to the algorithm's convergence rate.

Stochastic Optimization

Paper
Add Code

Earliness-Aware Deep Convolutional Networks for Early Time Series Classification

no code implementations • 14 Nov 2016 • Wenlin Wang, Changyou Chen, Wenqi Wang, Piyush Rai, Lawrence Carin

Unlike most existing methods for early classification of time series data, that are designed to solve this problem under the assumption of the availability of a good set of pre-defined (often hand-crafted) features, our framework can jointly perform feature learning (by learning a deep hierarchy of \emph{shapelets} capturing the salient characteristics in each time series), along with a dynamic truncation model to help our deep feature learning architecture focus on the early parts of each time series.

Classification Early Classification +4

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.