Search Results for author: Zhengxin Zhang

Found 7 papers, 3 papers with code

Quantized Side Tuning: Fast and Memory-Efficient Tuning of Quantized Large Language Models

1 code implementation • 13 Jan 2024 • Zhengxin Zhang, Dan Zhao, Xupeng Miao, Gabriele Oliaro, Qing Li, Yong Jiang, Zhihao Jia

Experiments show that QST can reduce the total memory footprint by up to 2. 3 $\times$ and speed up the finetuning process by up to 3 $\times$ while achieving competent performance compared with the state-of-the-art.

6

Paper
Code

SpecInfer: Accelerating Generative Large Language Model Serving with Tree-based Speculative Inference and Verification

3 code implementations • 16 May 2023 • Xupeng Miao, Gabriele Oliaro, Zhihao Zhang, Xinhao Cheng, Zeyu Wang, Zhengxin Zhang, Rae Ying Yee Wong, Alan Zhu, Lijie Yang, Xiaoxiang Shi, Chunan Shi, Zhuoming Chen, Daiyaan Arfeen, Reyna Abhyankar, Zhihao Jia

Our evaluation shows that SpecInfer outperforms existing LLM serving systems by 1. 5-2. 8x for distributed LLM inference and by 2. 6-3. 5x for offloading-based LLM inference, while preserving the same generative performance.

Language Modelling Large Language Model

1,516

Paper
Code

Cycle Consistent Probability Divergences Across Different Spaces

no code implementations • 22 Nov 2021 • Zhengxin Zhang, Youssef Mroueh, Ziv Goldfeld, Bharath K. Sriperumbudur

Discrepancy measures between probability distributions are at the core of statistical inference and machine learning.

Computational Efficiency Generative Adversarial Network

Paper
Add Code

Non-Asymptotic Performance Guarantees for Neural Estimation of $\mathsf{f}$-Divergences

no code implementations • 11 Mar 2021 • Sreejith Sreekumar, Zhengxin Zhang, Ziv Goldfeld

Statistical distances (SDs), which quantify the dissimilarity between probability distributions, are central to machine learning and statistics.

Paper
Add Code

ZQM at SemEval-2019 Task9: A Single Layer CNN Based on Pre-trained Model for Suggestion Mining

no code implementations • SEMEVAL 2019 • Qimin Zhou, Zhengxin Zhang, Hao Wu, Linmao Wang

In our system, the input of convolutional neural network is the embedding vectors which are drawn from the pre-trained BERT model.

Position Suggestion mining

Paper
Add Code

NLPZZX at SemEval-2018 Task 1: Using Ensemble Method for Emotion and Sentiment Intensity Determination

no code implementations • SEMEVAL 2018 • Zhengxin Zhang, Qimin Zhou, Hao Wu

We participate in two subtasks for English tweets: EI-reg and V-reg.

Sentiment Analysis

Paper
Add Code

Road Extraction by Deep Residual U-Net

13 code implementations • 29 Nov 2017 • Zhengxin Zhang, Qingjie Liu, Yunhong Wang

Road extraction from aerial images has been a hot research topic in the field of remote sensing image analysis.

Ranked #2 on Skin Cancer Segmentation on Kaggle Skin Lesion Segmentation

Lesion Segmentation Lung Nodule Segmentation +2

413

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.