Search Results for author: Yu Bao

Found 21 papers, 10 papers with code

GLAT: Glancing at Latent Variables for Parallel Text Generation

1 code implementation • ACL 2022 • Yu Bao, Hao Zhou, ShuJian Huang, Dongqi Wang, Lihua Qian, Xinyu Dai, Jiajun Chen, Lei LI

Recently, parallel text generation has received widespread attention due to its success in generation efficiency.

Text Generation

Paper
Code

EDT: Improving Large Language Models' Generation by Entropy-based Dynamic Temperature Sampling

1 code implementation • 21 Mar 2024 • Shimao Zhang, Yu Bao, ShuJian Huang

However, a fixed temperature parameter is used in most cases, which may not always be an optimal choice for balancing generation quality and diversity.

16,643

Paper
Code

DecompOpt: Controllable and Decomposed Diffusion Models for Structure-based Molecular Optimization

no code implementations • 7 Mar 2024 • Xiangxin Zhou, Xiwei Cheng, Yuwei Yang, Yu Bao, Liang Wang, Quanquan Gu

DecompOpt presents a new generation paradigm which combines optimization with conditional diffusion models to achieve desired properties while adhering to the molecular grammar.

Drug Discovery

Paper
Add Code

DecompDiff: Diffusion Models with Decomposed Priors for Structure-Based Drug Design

1 code implementation • 26 Feb 2024 • Jiaqi Guan, Xiangxin Zhou, Yuwei Yang, Yu Bao, Jian Peng, Jianzhu Ma, Qiang Liu, Liang Wang, Quanquan Gu

Designing 3D ligands within a target binding site is a fundamental task in drug discovery.

Avg Drug Discovery

Paper
Code

Binding-Adaptive Diffusion Models for Structure-Based Drug Design

1 code implementation • 15 Jan 2024 • Zhilin Huang, Ling Yang, Zaixi Zhang, Xiangxin Zhou, Yu Bao, Xiawu Zheng, Yuwei Yang, Yu Wang, Wenming Yang

Then the selected protein-ligand subcomplex is processed with SE(3)-equivariant neural networks, and transmitted back to each atom of the complex for augmenting the target-aware 3D molecule diffusion generation with binding interaction information.

Avg

Paper
Code

Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning

1 code implementation • 23 Aug 2023 • Jiasheng Ye, Zaixiang Zheng, Yu Bao, Lihua Qian, Quanquan Gu

We then reprogram pretrained masked language models into diffusion language models via diffusive adaptation, wherein task-specific finetuning and instruction finetuning are explored to unlock their versatility in solving general language tasks.

In-Context Learning Language Modelling +1

Paper
Code

Selective Knowledge Distillation for Non-Autoregressive Neural Machine Translation

no code implementations • 31 Mar 2023 • Min Liu, Yu Bao, Chengqi Zhao, ShuJian Huang

Benefiting from the sequence-level knowledge distillation, the Non-Autoregressive Transformer (NAT) achieves great success in neural machine translation tasks.

Knowledge Distillation Machine Translation +1

Paper
Add Code

DINOISER: Diffused Conditional Sequence Learning by Manipulating Noises

1 code implementation • 20 Feb 2023 • Jiasheng Ye, Zaixiang Zheng, Yu Bao, Lihua Qian, Mingxuan Wang

In this paper, we introduce DINOISER to facilitate diffusion models for sequence generation by manipulating noises.

Paper
Code

SENDER: SEmi-Nonlinear Deep Efficient Reconstructor for Extraction Canonical, Meta, and Sub Functional Connectivity in the Human Brain

no code implementations • 12 Sep 2022 • Wei zhang, Yu Bao

Deep Linear and Nonlinear learning methods have already been vital machine learning methods for investigating the hierarchical features such as functional connectivity in the human brain via functional Magnetic Resonance signals; however, there are three major shortcomings: 1).

Paper
Add Code

DELMAR: Deep Linear Matrix Approximately Reconstruction to Extract Hierarchical Functional Connectivity in the Human Brain

no code implementations • 20 May 2022 • Wei zhang, Yu Bao

Moreover, the theoretical analyses indicate that DELMAR can converge to the unique fixed point and even enable the accurate approximation of original input as DNNs.

Paper
Add Code

DEMAND: Deep Matrix Approximately Nonlinear Decomposition to Identify Meta, Canonical, and Sub-Spatial Pattern of functional Magnetic Resonance Imaging in the Human Brain

no code implementations • 20 May 2022 • Wei zhang, Yu Bao

At first, the proposed DEMAND employs a non-fully connected and multilayer-stacked architecture that is easier to be optimized compared with canonical DNNs; furthermore, due to the efficient architecture, training DEMAND can avoid overfitting and enables the recognition of individual/minor features based on a small dataset such as an individual data; finally, a novel rank estimator technique is introduced to tune all hyperparameters of DEMAND automatically.

Dictionary Learning

Paper
Add Code

SADAM: Stochastic Adam, A Stochastic Operator for First-Order Gradient-based Optimizer

no code implementations • 20 May 2022 • Wei zhang, Yu Bao

In this work, to efficiently help escape the stationary and saddle points, we propose, analyze, and generalize a stochastic strategy performed as an operator for a first-order gradient descent algorithm in order to increase the target accuracy and reduce time consumption.

Paper
Add Code

$\textit{latent}$-GLAT: Glancing at Latent Variables for Parallel Text Generation

1 code implementation • 5 Apr 2022 • Yu Bao, Hao Zhou, ShuJian Huang, Dongqi Wang, Lihua Qian, Xinyu Dai, Jiajun Chen, Lei LI

Recently, parallel text generation has received widespread attention due to its success in generation efficiency.

Text Generation

Paper
Code

Experimental quantum advantage with quantum coupon collector

no code implementations • 15 Dec 2021 • Min-Gang Zhou, Xiao-Yu Cao, Yu-Shuo Lu, Yang Wang, Yu Bao, Zhao-Ying Jia, Yao Fu, Hua-Lei Yin, Zeng-Bing Chen

The information transmitted by the proposed game also broke the classical limit.

Paper
Add Code

Non-Autoregressive Translation by Learning Target Categorical Codes

1 code implementation • NAACL 2021 • Yu Bao, ShuJian Huang, Tong Xiao, Dongqi Wang, Xinyu Dai, Jiajun Chen

Non-autoregressive Transformer is a promising text generation model.

Ranked #7 on Machine Translation on WMT2014 German-English

Attribute Machine Translation +2

Paper
Code

Non-iterative Parallel Text Generation via Glancing Transformer

no code implementations • 1 Jan 2021 • Lihua Qian, Hao Zhou, Yu Bao, Mingxuan Wang, Lin Qiu, Weinan Zhang, Yong Yu, Lei LI

Although non-autoregressive models with one-iteration generation achieves remarkable inference speed-up, they still falls behind their autoregressive counterparts inprediction accuracy.

Language Modelling Text Generation

Paper
Add Code

Glancing Transformer for Non-Autoregressive Neural Machine Translation

1 code implementation • ACL 2021 • Lihua Qian, Hao Zhou, Yu Bao, Mingxuan Wang, Lin Qiu, Wei-Nan Zhang, Yong Yu, Lei LI

With GLM, we develop Glancing Transformer (GLAT) for machine translation.

Ranked #69 on Machine Translation on WMT2014 English-German

Language Modelling Machine Translation +1

131

Paper
Code

Explicit Semantic Decomposition for Definition Generation

no code implementations • ACL 2020 • Jiahuan Li, Yu Bao, Shu-Jian Huang, Xin-yu Dai, Jia-Jun Chen

Definition generation, which aims to automatically generate dictionary definitions for words, has recently been proposed to assist the construction of dictionaries and help people understand unfamiliar texts.