Search Results for author: Yun Zhu

Found 26 papers, 7 papers with code

Multitask Multilingual Model Adaptation with Featurized Low-Rank Mixtures

no code implementations27 Feb 2024 Chu-Cheng Lin, Xinyi Wang, Jonathan H. Clark, Han Lu, Yun Zhu, Chenxi Whitehouse, Hongkun Yu

By composing feature-specific parameters for each dataset, FLix can accommodate diverse dataset mixtures and generalize better to unseen datasets.

Corrective Retrieval Augmented Generation

1 code implementation29 Jan 2024 Shi-Qi Yan, Jia-Chen Gu, Yun Zhu, Zhen-Hua Ling

Experiments on four datasets covering short- and long-form generation tasks show that CRAG can significantly improve the performance of RAG-based approaches.

Retrieval

Efficient Tuning and Inference for Large Language Models on Textual Graphs

no code implementations28 Jan 2024 Yun Zhu, Yaoke Wang, Haizhou Shi, Siliang Tang

In this paper, we propose ENGINE, a parameter- and memory-efficient fine-tuning method for textual graphs with an LLM encoder.

Beyond Sparse Rewards: Enhancing Reinforcement Learning with Language Model Critique in Text Generation

no code implementations14 Jan 2024 Meng Cao, Lei Shu, Lei Yu, Yun Zhu, Nevan Wichers, Yinxiao Liu, Lei Meng

We investigate this approach under two different settings: one where the policy model is smaller and is paired with a more powerful critic model, and another where a single language model fulfills both roles.

Language Modelling reinforcement-learning +2

SPGroup3D: Superpoint Grouping Network for Indoor 3D Object Detection

1 code implementation21 Dec 2023 Yun Zhu, Le Hui, Yaqi Shen, Jin Xie

To this end, we propose a novel superpoint grouping network for indoor anchor-free one-stage 3D object detection.

3D Object Detection object-detection

Fusion-Eval: Integrating Evaluators with LLMs

no code implementations15 Nov 2023 Lei Shu, Nevan Wichers, Liangchen Luo, Yun Zhu, Yinxiao Liu, Jindong Chen, Lei Meng

Evaluating natural language systems poses significant challenges, particularly in the realms of natural language understanding and high-level reasoning.

Natural Language Understanding

RedCoast: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUs

1 code implementation25 Oct 2023 Bowen Tan, Yun Zhu, Lijuan Liu, Hongyi Wang, Yonghao Zhuang, Jindong Chen, Eric Xing, Zhiting Hu

In this work, we present RedCoast(Redco), a lightweight and user-friendly tool crafted to automate distributed training and inference for LLMs, as well as to simplify ML pipeline development.

Language Modelling Meta-Learning

GraphControl: Adding Conditional Control to Universal Graph Pre-trained Models for Graph Domain Transfer Learning

no code implementations11 Oct 2023 Yun Zhu, Yaoke Wang, Haizhou Shi, Zhenshuo Zhang, Dian Jiao, Siliang Tang

These pre-trained models can be applied to various downstream Web applications, saving training time and improving downstream (target) performance.

Attribute Specificity +1

Critique Ability of Large Language Models

no code implementations7 Oct 2023 Liangchen Luo, Zi Lin, Yinxiao Liu, Lei Shu, Yun Zhu, Jingbo Shang, Lei Meng

In the era of large language models (LLMs), this study explores the ability of LLMs to deliver accurate critiques across various tasks.

Code Completion Decision Making +3

MARIO: Model Agnostic Recipe for Improving OOD Generalization of Graph Contrastive Learning

1 code implementation24 Jul 2023 Yun Zhu, Haizhou Shi, Zhenshuo Zhang, Siliang Tang

In this work, we investigate the problem of out-of-distribution (OOD) generalization for unsupervised learning methods on graph data.

Contrastive Learning Data Augmentation

Analysis and tuning of a three-term DMC

no code implementations25 May 2023 Yun Zhu, Kangkang Zhang, Yuncai Zhu, Jinming Zhou

Most MPC (Model Predictive Control) algorithms used in industries and studied in the control academia use a two-term QP (quadratic programming), where the first term is the weighted norm of the output errors, and the second term is that of the input increments.

Model Predictive Control

RewriteLM: An Instruction-Tuned Large Language Model for Text Rewriting

1 code implementation25 May 2023 Lei Shu, Liangchen Luo, Jayakumar Hoskere, Yun Zhu, Yinxiao Liu, Simon Tong, Jindong Chen, Lei Meng

In this work, we develop new strategies for instruction tuning and reinforcement learning to better align LLMs for cross-sentence rewriting tasks using diverse wording and structures expressed through natural languages including 1) generating rewriting instruction data from Wiki edits and public corpus through instruction generation and chain-of-thought prompting; 2) collecting comparison data for reward model training through a new ranking function.

Language Modelling Large Language Model +3

SmartBERT: A Promotion of Dynamic Early Exiting Mechanism for Accelerating BERT Inference

no code implementations16 Mar 2023 Boren Hu, Yun Zhu, Jiacheng Li, Siliang Tang

In this paper, we propose a novel dynamic early exiting combined with layer skipping for BERT inference named SmartBERT, which adds a skipping gate and an exiting operator into each layer of BERT.

Contrastive Learning Language Modelling +2

Structure-Aware Group Discrimination with Adaptive-View Graph Encoder: A Fast Graph Contrastive Learning Framework

no code implementations9 Mar 2023 Zhenshuo Zhang, Yun Zhu, Haizhou Shi, Siliang Tang

Albeit having gained significant progress lately, large-scale graph representation learning remains expensive to train and deploy for two main reasons: (i) the repetitive computation of multi-hop message passing and non-linearity in graph neural networks (GNNs); (ii) the computational cost of complex pairwise contrastive learning loss.

Contrastive Learning Graph Representation Learning

SGL-PT: A Strong Graph Learner with Graph Prompt Tuning

no code implementations24 Feb 2023 Yun Zhu, Jianhao Guo, Siliang Tang

And aiming for graph classification task, we unify pre-training and fine-tuning by designing a novel verbalizer-free prompting function, which reformulates the downstream task in a similar format as pretext task.

Graph Classification Graph Learning

RoSA: A Robust Self-Aligned Framework for Node-Node Graph Contrastive Learning

1 code implementation29 Apr 2022 Yun Zhu, Jianhao Guo, Fei Wu, Siliang Tang

To the best of our awareness, RoSA is the first work focuses on the non-aligned node-node graph contrastive learning problem.

Contrastive Learning Node Classification +1

Fostering the Robustness of White-Box Deep Neural Network Watermarks by Neuron Alignment

no code implementations28 Dec 2021 Fang-Qi Li, Shi-Lin Wang, Yun Zhu

The wide application of deep learning techniques is boosting the regulation of deep learning models, especially deep neural networks (DNN), as commercial products.

Efficient Sensor Management for Multitarget Tracking in Passive Sensor Networks via Cauchy-Schwarz Divergence

no code implementations3 Nov 2020 Yun Zhu

To address this problem, we present an efficient information-theoretic approach to manage the sensors for better tracking of the unknown and time-varying number of targets.

Management

Active Learning for Product Type Ontology Enhancement in E-commerce

no code implementations19 Sep 2020 Yun Zhu, Sayyed M. Zahiri, Jiaqi Wang, Han-Yu Chen, Faizan Javed

Entity-based semantic search has been widely adopted in modern search engines to improve search accuracy by understanding users' intent.

Active Learning Vocal Bursts Type Prediction

Fast Coherent Point Drift

no code implementations11 Jun 2020 Xiang-Wei Feng, Da-Zheng Feng, Yun Zhu

After iteration begins, our method only needs to update a diagonal matrix with linear computational complexity, and perform matrix multiplication operation with time complexity approximately O(M2) in each iteration.

Semantic Similarity Strategies for Job Title Classification

no code implementations20 Sep 2016 Yun Zhu, Faizan Javed, Ozgur Ozturk

A large-scale job title classification system can power various downstream applications such as semantic search, job recommendations and labor market analytics.

Classification General Classification +2

Cannot find the paper you are looking for? You can Submit a new open access paper.