Search Results for author: Quzhe Huang

Found 16 papers, 12 papers with code

Harder Tasks Need More Experts: Dynamic Routing in MoE Models

1 code implementation • 12 Mar 2024 • Quzhe Huang, Zhenwei An, Nan Zhuang, Mingxu Tao, Chen Zhang, Yang Jin, Kun Xu, Liwei Chen, Songfang Huang, Yansong Feng

In this paper, we introduce a novel dynamic expert selection framework for Mixture of Experts (MoE) models, aiming to enhance computational efficiency and model performance by adjusting the number of activated experts based on input difficulty.

Computational Efficiency

Paper
Code

Probing Multimodal Large Language Models for Global and Local Semantic Representations

1 code implementation • 27 Feb 2024 • Mingxu Tao, Quzhe Huang, Kun Xu, Liwei Chen, Yansong Feng, Dongyan Zhao

The advancement of Multimodal Large Language Models (MLLMs) has greatly accelerated the development of applications in understanding integrated texts and images.

object-detection Object Detection +1

Paper
Code

Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization

1 code implementation • 5 Feb 2024 • Yang Jin, Zhicheng Sun, Kun Xu, Liwei Chen, Hao Jiang, Quzhe Huang, Chengru Song, Yuliang Liu, Di Zhang, Yang song, Kun Gai, Yadong Mu

In light of recent advances in multimodal Large Language Models (LLMs), there is increasing attention to scaling them from image-text data to more informative real-world videos.

Ranked #63 on Visual Question Answering on MM-Vet

Video Understanding Visual Question Answering

354

Paper
Code

Relation-Aware Question Answering for Heterogeneous Knowledge Graphs

1 code implementation • 19 Dec 2023 • Haowei Du, Quzhe Huang, Chen Li, Chen Zhang, Yang Li, Dongyan Zhao

To address this issue, we construct a \textbf{dual relation graph} where each node denotes a relation in the original KG (\textbf{primal entity graph}) and edges are constructed between relations sharing same head or tail entities.

Knowledge Base Question Answering Knowledge Graphs +1

Paper
Code

MC^2: A Multilingual Corpus of Minority Languages in China

1 code implementation • 14 Nov 2023 • Chen Zhang, Mingxu Tao, Quzhe Huang, Jiuheng Lin, Zhibin Chen, Yansong Feng

However, existing LLMs exhibit limited abilities in understanding low-resource languages, including the minority languages in China, due to a lack of training data.

Paper
Code

From Simple to Complex: A Progressive Framework for Document-level Informative Argument Extraction

1 code implementation • 25 Oct 2023 • Quzhe Huang, Yanxi Zhang, Dongyan Zhao

These methods extract events according to their appearance order in the document, however, the event that appears in the first sentence does not mean that it is the easiest to extract.

Event Argument Extraction Sentence

Paper
Code

Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization

1 code implementation • 9 Sep 2023 • Yang Jin, Kun Xu, Liwei Chen, Chao Liao, Jianchao Tan, Quzhe Huang, Bin Chen, Chenyi Lei, An Liu, Chengru Song, Xiaoqiang Lei, Di Zhang, Wenwu Ou, Kun Gai, Yadong Mu

Specifically, we introduce a well-designed visual tokenizer to translate the non-linguistic image into a sequence of discrete tokens like a foreign language that LLM can read.

Language Modelling Large Language Model +1

354

Paper
Code

More than Classification: A Unified Framework for Event Temporal Relation Extraction

no code implementations • 28 May 2023 • Quzhe Huang, Yutong Hu, Shengqi Zhu, Yansong Feng, Chang Liu, Dongyan Zhao

After examining the relation definitions in various ETRE tasks, we observe that all relations can be interpreted using the start and end time points of events.

Multi-Label Classification Relation +1

Paper
Add Code

Lawyer LLaMA Technical Report

1 code implementation • 24 May 2023 • Quzhe Huang, Mingxu Tao, Chen Zhang, Zhenwei An, Cong Jiang, Zhibin Chen, Zirui Wu, Yansong Feng

Specifically, we inject domain knowledge during the continual training stage and teach the model to learn professional skills using properly designed supervised fine-tuning tasks.

Hallucination Retrieval

754

Paper
Code

Do Charge Prediction Models Learn Legal Theory?

1 code implementation • 31 Oct 2022 • Zhenwei An, Quzhe Huang, Cong Jiang, Yansong Feng, Dongyan Zhao

The charge prediction task aims to predict the charge for a case given its fact description.

Paper
Code

Knowledge-enhanced Iterative Instruction Generation and Reasoning for Knowledge Base Question Answering

no code implementations • 7 Sep 2022 • Haowei Du, Quzhe Huang, Chen Zhang, Dongyan Zhao

Multi-hop Knowledge Base Question Answering(KBQA) aims to find the answer entity in a knowledge base which is several hops from the topic entity mentioned in the question.

Knowledge Base Question Answering Retrieval

Paper
Add Code

Does Recommend-Revise Produce Reliable Annotations? An Analysis on Missing Instances in DocRED

1 code implementation • ACL 2022 • Quzhe Huang, Shibo Hao, Yuan Ye, Shengqi Zhu, Yansong Feng, Dongyan Zhao

DocRED is a widely used dataset for document-level relation extraction.

Document-level Relation Extraction Relation

Paper
Code

Exploring Distantly-Labeled Rationales in Neural Network Models

no code implementations • ACL 2021 • Quzhe Huang, Shengqi Zhu, Yansong Feng, Dongyan Zhao

Recent studies strive to incorporate various human rationales into neural networks to improve model performance, but few pay attention to the quality of the rationales.

Paper
Add Code

Three Sentences Are All You Need: Local Path Enhanced Document Relation Extraction

1 code implementation • ACL 2021 • Quzhe Huang, Shengqi Zhu, Yansong Feng, Yuan Ye, Yuxuan Lai, Dongyan Zhao

Document-level Relation Extraction (RE) is a more challenging task than sentence RE as it often requires reasoning over multiple sentences.

Ranked #48 on Relation Extraction on DocRED

Document-level Relation Extraction Relation +1

Paper
Code

Why Machine Reading Comprehension Models Learn Shortcuts?

1 code implementation • Findings (ACL) 2021 • Yuxuan Lai, Chen Zhang, Yansong Feng, Quzhe Huang, Dongyan Zhao

A thorough empirical analysis shows that MRC models tend to learn shortcut questions earlier than challenging questions, and the high proportions of shortcut questions in training sets hinder models from exploring the sophisticated reasoning skills in the later stage of training.

Machine Reading Comprehension

Paper
Code

Towards Context-Aware Code Comment Generation

no code implementations • Findings of the Association for Computational Linguistics 2020 • Xiaohan Yu, Quzhe Huang, Zheng Wang, Yansong Feng, Dongyan Zhao

Code comments are vital for software maintenance and comprehension, but many software projects suffer from the lack of meaningful and up-to-date comments in practice.

Code Comment Generation Comment Generation +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.