Search Results for author: Qinglin Zhang

Found 17 papers, 6 papers with code

Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language Model Systems

no code implementations • 11 Jan 2024 • Tianyu Cui, Yanling Wang, Chuanpu Fu, Yong Xiao, Sijia Li, Xinhao Deng, Yunpeng Liu, Qinglin Zhang, Ziyi Qiu, Peiyang Li, Zhixing Tan, Junwu Xiong, Xinyu Kong, Zujie Wen, Ke Xu, Qi Li

Based on this, we propose a comprehensive taxonomy, which systematically analyzes potential risks associated with each module of an LLM system and discusses the corresponding mitigation strategies.

Language Modelling Large Language Model

Paper
Add Code

Loss Masking Is Not Needed in Decoder-only Transformer for Discrete-token-based ASR

1 code implementation • 8 Nov 2023 • Qian Chen, Wen Wang, Qinglin Zhang, Siqi Zheng, Shiliang Zhang, Chong Deng, Yukun Ma, Hai Yu, Jiaqing Liu, Chong Zhang

We find that applying the conventional cross-entropy loss on input speech tokens does not consistently improve the ASR performance over the Loss Masking approach.

Decoder

Paper
Code

Improving Long Document Topic Segmentation Models With Enhanced Coherence Modeling

1 code implementation • 18 Oct 2023 • Hai Yu, Chong Deng, Qinglin Zhang, Jiaqing Liu, Qian Chen, Wen Wang

Our approach improve $F_1$ of old SOTA by 3. 42 (73. 74 -> 77. 16) and reduces $P_k$ by 1. 11 points (15. 0 -> 13. 89) on WIKI-727K and achieves an average relative reduction of 4. 3% on $P_k$ on WikiSection.

Information Retrieval Segmentation +3

Paper
Code

Improving Speaker Diarization using Semantic Information: Joint Pairwise Constraints Propagation

no code implementations • 19 Sep 2023 • Luyao Cheng, Siqi Zheng, Qinglin Zhang, Hui Wang, Yafeng Chen, Qian Chen, Shiliang Zhang

Speaker diarization has gained considerable attention within speech processing research community.

speaker-diarization Speaker Diarization +1

Paper
Add Code

Improving BERT with Hybrid Pooling Network and Drop Mask

no code implementations • 14 Jul 2023 • Qian Chen, Wen Wang, Qinglin Zhang, Chong Deng, Ma Yukun, Siqi Zheng

Transformer-based pre-trained language models, such as BERT, achieve great success in various natural language understanding tasks.

Language Modelling Masked Language Modeling +2

Paper
Add Code

Advancing Precise Outline-Conditioned Text Generation with Task Duality and Explicit Outline Control

no code implementations • 23 May 2023 • Yunzhe Li, Qian Chen, Weixiang Yan, Wen Wang, Qinglin Zhang, Hari Sundaram

Furthermore, we identify an issue of imbalanced utilization of the outline information in the precise outline-conditioned generation, which is ubiquitously observed across fine-tuned models and zero-shot inference models.

Sentence Text Generation

Paper
Add Code

Ditto: A Simple and Efficient Approach to Improve Sentence Embeddings

1 code implementation • 18 May 2023 • Qian Chen, Wen Wang, Qinglin Zhang, Siqi Zheng, Chong Deng, Hai Yu, Jiaqing Liu, Yukun Ma, Chong Zhang

Prior studies diagnose the anisotropy problem in sentence representations from pre-trained language models, e. g., BERT, without fine-tuning.

Language Modelling Semantic Textual Similarity +4

Paper
Code

Meeting Action Item Detection with Regularized Context Modeling

no code implementations • 27 Mar 2023 • Jiaqing Liu, Chong Deng, Qinglin Zhang, Qian Chen, Wen Wang

We construct and release the first Chinese meeting corpus with manual action item annotations.

Contrastive Learning

Paper
Add Code

Overview of the ICASSP 2023 General Meeting Understanding and Generation Challenge (MUG)

no code implementations • 24 Mar 2023 • Qinglin Zhang, Chong Deng, Jiaqing Liu, Hai Yu, Qian Chen, Wen Wang, Zhijie Yan, Jinglin Liu, Yi Ren, Zhou Zhao

ICASSP2023 General Meeting Understanding and Generation Challenge (MUG) focuses on prompting a wide range of spoken language processing (SLP) research on meeting transcripts, as SLP applications are critical to improve users' efficiency in grasping important information in meetings.

Extractive Summarization Keyphrase Extraction

Paper
Add Code

MUG: A General Meeting Understanding and Generation Benchmark

1 code implementation • 24 Mar 2023 • Qinglin Zhang, Chong Deng, Jiaqing Liu, Hai Yu, Qian Chen, Wen Wang, Zhijie Yan, Jinglin Liu, Yi Ren, Zhou Zhao

To prompt SLP advancement, we establish a large-scale general Meeting Understanding and Generation Benchmark (MUG) to benchmark the performance of a wide range of SLP tasks, including topic segmentation, topic-level and session-level extractive summarization and topic title generation, keyphrase extraction, and action item detection.

Extractive Summarization Keyphrase Extraction +1

Paper
Code

DopplerBAS: Binaural Audio Synthesis Addressing Doppler Effect

no code implementations • 14 Dec 2022 • Jinglin Liu, Zhenhui Ye, Qian Chen, Siqi Zheng, Wen Wang, Qinglin Zhang, Zhou Zhao

Recently, binaural audio synthesis (BAS) has emerged as a promising research field for its applications in augmented and virtual realities.

Audio Synthesis

Paper
Add Code

PoNet: Pooling Network for Efficient Token Mixing in Long Sequences

1 code implementation • ICLR 2022 • Chao-Hong Tan, Qian Chen, Wen Wang, Qinglin Zhang, Siqi Zheng, Zhen-Hua Ling

We propose a novel Pooling Network (PoNet) for token mixing in long sequences with linear complexity.

Transfer Learning

Paper
Code

Sequence Model with Self-Adaptive Sliding Window for Efficient Spoken Document Segmentation

1 code implementation • 20 Jul 2021 • Qinglin Zhang, Qian Chen, YaLi Li, Jiaqing Liu, Wen Wang

Evaluations are conducted on the English Wiki-727K document segmentation benchmark, a Chinese Wikipedia-based document segmentation dataset we created, and an in-house Chinese spoken document dataset.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Code

Pre-training for Spoken Language Understanding with Joint Textual and Phonetic Representation Learning

no code implementations • 21 Apr 2021 • Qian Chen, Wen Wang, Qinglin Zhang

In this paper, we propose a novel joint textual-phonetic pre-training approach for learning spoken language representations, aiming at exploring the full potentials of phonetic information to improve SLU robustness to ASR errors.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +6

Paper
Add Code

Discriminative Self-training for Punctuation Prediction

no code implementations • 21 Apr 2021 • Qian Chen, Wen Wang, Mengzhe Chen, Qinglin Zhang

Punctuation prediction for automatic speech recognition (ASR) output transcripts plays a crucial role for improving the readability of the ASR transcripts and for improving the performance of downstream natural language processing applications.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

An Application of ASP Theories of Intentions to Understanding Restaurant Scenarios: Insights and Narrative Corpus

no code implementations • 30 Sep 2018 • Qinglin Zhang, Chris Benton, Daniela Inclezan

This paper presents a practical application of Answer Set Programming to the understanding of narratives about restaurants.

Paper
Add Code

An ASP Methodology for Understanding Narratives about Stereotypical Activities

no code implementations • 26 Apr 2018 • Daniela Inclezan, Qinglin Zhang, Marcello Balduccini, Ankush Israney

We exemplify the application of this methodology by answering questions about a number of restaurant stories.

Question Answering

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.