Search Results for author: Yijia Zhang

Found 12 papers, 4 papers with code

BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation

1 code implementation • 16 Feb 2024 • Dayou Du, Yijia Zhang, Shijie Cao, Jiaqi Guo, Ting Cao, Xiaowen Chu, Ningyi Xu

The upscaling of Large Language Models (LLMs) has yielded impressive advances in natural language processing, yet it also poses significant deployment challenges.

Knowledge Distillation Quantization

Paper
Code

CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models

1 code implementation • 28 Nov 2023 • Jinfeng Zhou, Zhuang Chen, Dazhen Wan, Bosi Wen, Yi Song, Jifan Yu, Yongkang Huang, Libiao Peng, Jiaming Yang, Xiyao Xiao, Sahand Sabour, Xiaohan Zhang, Wenjing Hou, Yijia Zhang, Yuxiao Dong, Jie Tang, Minlie Huang

In this paper, we present CharacterGLM, a series of models built upon ChatGLM, with model sizes ranging from 6B to 66B parameters.

Dialogue Generation

313

Paper
Code

AFPQ: Asymmetric Floating Point Quantization for LLMs

1 code implementation • 3 Nov 2023 • Yijia Zhang, Sicheng Zhang, Shijie Cao, Dayou Du, Jianyu Wei, Ting Cao, Ningyi Xu

Large language models (LLMs) show great performance in various tasks, but face deployment challenges from limited memory capacity and bandwidth.

Quantization

Paper
Code

A Transformer-Based Model With Self-Distillation for Multimodal Emotion Recognition in Conversations

1 code implementation • 31 Oct 2023 • Hui Ma, Jian Wang, Hongfei Lin, Bo Zhang, Yijia Zhang, Bo Xu

Emotion recognition in conversations (ERC), the task of recognizing the emotion of each utterance in a conversation, is crucial for building empathetic machines.

Ranked #1 on Emotion Recognition in Conversation on IEMOCAP

Emotion Recognition in Conversation Multimodal Emotion Recognition

Paper
Code

TKwinFormer: Top k Window Attention in Vision Transformers for Feature Matching

no code implementations • 29 Aug 2023 • Yun Liao, Yide Di, Hao Zhou, Kaijun Zhu, Mingyu Lu, Yijia Zhang, Qing Duan, Junhui Liu

Local feature matching remains a challenging task, primarily due to difficulties in matching sparse keypoints and low-texture regions.

Paper
Add Code

Adam Accumulation to Reduce Memory Footprints of both Activations and Gradients for Large-scale DNN Training

no code implementations • 31 May 2023 • Yijia Zhang, Yibo Han, Shijie Cao, Guohao Dai, Youshan Miao, Ting Cao, Fan Yang, Ningyi Xu

We find that previous gradient accumulation reduces activation memory but fails to be compatible with gradient memory reduction due to a contradiction between preserving gradients and releasing gradients.

Paper
Add Code

Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models

no code implementations • 21 May 2023 • Yijia Zhang, Lingran Zhao, Shijie Cao, WenQiang Wang, Ting Cao, Fan Yang, Mao Yang, Shanghang Zhang, Ningyi Xu

In this study, we conduct a comparative analysis of INT and FP quantization with the same bit-width, revealing that the optimal quantization format varies across different layers due to the complexity and diversity of tensor distribution.

Quantization

Paper
Add Code

TC-GAT: Graph Attention Network for Temporal Causality Discovery

no code implementations • 21 Apr 2023 • Xiaosong Yuan, Ke Chen, Wanli Zuo, Yijia Zhang

The present study explores the intricacies of causal relationship extraction, a vital component in the pursuit of causality knowledge.

Graph Attention

Paper
Add Code

A Unified Review of Deep Learning for Automated Medical Coding

no code implementations • 8 Jan 2022 • Shaoxiong Ji, Wei Sun, Xiaobo Li, Hang Dong, Ara Taalas, Yijia Zhang, Honghan Wu, Esa Pitkänen, Pekka Marttinen

Automated medical coding, an essential task for healthcare operation and delivery, makes unstructured data manageable by predicting medical codes from clinical documents.

Paper
Add Code

Exploring Semi-supervised Variational Autoencoders for Biomedical Relation Extraction

no code implementations • 18 Jan 2019 • Yijia Zhang, Zhiyong Lu

Experimental results show that our method effectively exploits the unlabeled data to improve the performance and reduce the dependence on labeled data.

Relation Relation Extraction

Paper
Add Code

Drug–drug interaction extraction via hierarchical RNNs on sequence and shortest dependency paths

no code implementations • Bioinformatics 2017 • Yijia Zhang, Wei Zheng, Hongfei Lin, Jian Wang, Zhihao Yang, Michel Dumontier

Results In this article, we present a hierarchical recurrent neural networks (RNNs)-based method to integrate the SDP and sentence sequence for DDI extraction task.

Ranked #5 on Drug–drug Interaction Extraction on DDI extraction 2013 corpus

Drug–drug Interaction Extraction Sentence

Paper
Add Code

A graph kernel based on context vectors for extracting drug–drug interactions

no code implementations • Journal of Biomedical Informatics 2016 • Wei Zheng, Hongfei Lin, Zhehuan Zhao, Bo Xu, Yijia Zhang, Zhihao Yang, Jian Wang

Especially for the Medline-2013 dataset, our system outperforms the top-ranking DDIs systems by F-scores of 10. 7 and 12. 2 in detection and classification, respectively.

Ranked #7 on Drug–drug Interaction Extraction on DDI extraction 2013 corpus

Drug–drug Interaction Extraction Sentence

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.