Search Results for author: Linyi Yang

Found 31 papers, 20 papers with code

A Rationale-centric Counterfactual Data Augmentation Method for Cross-Document Event Coreference Resolution

1 code implementation • 2 Apr 2024 • Bowen Ding, Qingkai Min, Shengkun Ma, Yingjie Li, Linyi Yang, Yue Zhang

Based on Pre-trained Language Models (PLMs), event coreference resolution (ECR) systems have demonstrated outstanding performance in clustering coreferential events across documents.

coreference-resolution counterfactual +3

Paper
Code

Detoxifying Large Language Models via Knowledge Editing

1 code implementation • 21 Mar 2024 • Mengru Wang, Ningyu Zhang, Ziwen Xu, Zekun Xi, Shumin Deng, Yunzhi Yao, Qishen Zhang, Linyi Yang, Jindong Wang, Huajun Chen

This paper investigates using knowledge editing techniques to detoxify Large Language Models (LLMs).

knowledge editing

1,390

Paper
Code

LLMs with Chain-of-Thought Are Non-Causal Reasoners

1 code implementation • 25 Feb 2024 • Guangsheng Bao, Hongbo Zhang, Linyi Yang, Cunxiang Wang, Yue Zhang

We further examine the factors influencing the causal structure of the implied SCM, revealing that in-context learning, supervised fine-tuning, and reinforcement learning on human feedback significantly impact the causal relations.

In-Context Learning

Paper
Code

MRKE: The Multi-hop Reasoning Evaluation of LLMs by Knowledge Edition

no code implementations • 19 Feb 2024 • Jian Wu, Linyi Yang, Manabu Okumura, Yue Zhang

Although Large Language Models (LLMs) have shown strong performance in Multi-hop Question Answering (MHQA) tasks, their real reasoning ability remains exploration.

Multi-hop Question Answering Question Answering

Paper
Add Code

GenDec: A robust generative Question-decomposition method for Multi-hop reasoning

no code implementations • 17 Feb 2024 • Jian Wu, Linyi Yang, Yuliang Ji, Wenhao Huang, Börje F. Karlsson, Manabu Okumura

Multi-hop QA (MHQA) involves step-by-step reasoning to answer complex questions and find multiple relevant supporting facts.

Multi-hop Question Answering Question Answering +1

Paper
Add Code

Supervised Knowledge Makes Large Language Models Better In-context Learners

1 code implementation • 26 Dec 2023 • Linyi Yang, Shuibai Zhang, Zhuohao Yu, Guangsheng Bao, Yidong Wang, Jindong Wang, Ruochen Xu, Wei Ye, Xing Xie, Weizhu Chen, Yue Zhang

Large Language Models (LLMs) exhibit emerging in-context learning abilities through prompt engineering.

In-Context Learning Natural Language Understanding +2

Paper
Code

Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity

1 code implementation • 11 Oct 2023 • Cunxiang Wang, Xiaoze Liu, Yuanhao Yue, Xiangru Tang, Tianhang Zhang, Cheng Jiayang, Yunzhi Yao, Wenyang Gao, Xuming Hu, Zehan Qi, Yidong Wang, Linyi Yang, Jindong Wang, Xing Xie, Zheng Zhang, Yue Zhang

This survey addresses the crucial issue of factuality in Large Language Models (LLMs).

Retrieval Specificity

280

Paper
Code

Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability Curvature

1 code implementation • 8 Oct 2023 • Guangsheng Bao, Yanbin Zhao, Zhiyang Teng, Linyi Yang, Yue Zhang

Large language models (LLMs) have shown the ability to produce fluent and cogent content, presenting both productivity opportunities and societal risks.

118

Paper
Code

A Survey on Evaluation of Large Language Models

1 code implementation • 6 Jul 2023 • Yupeng Chang, Xu Wang, Jindong Wang, Yuan Wu, Linyi Yang, Kaijie Zhu, Hao Chen, Xiaoyuan Yi, Cunxiang Wang, Yidong Wang, Wei Ye, Yue Zhang, Yi Chang, Philip S. Yu, Qiang Yang, Xing Xie

Large language models (LLMs) are gaining increasing popularity in both academia and industry, owing to their unprecedented performance in various applications.

Ethics

1,225

Paper
Code

Masked conditional variational autoencoders for chromosome straightening

no code implementations • 25 Jun 2023 • Jingxiong Li, Sunyi Zheng, Zhongyi Shui, Shichuan Zhang, Linyi Yang, Yuxuan Sun, Yunlong Zhang, Honglin Li, Yuanxin Ye, Peter M. A. van Ooijen, Kang Li, Lin Yang

This yields a non-trivial reconstruction task, allowing the model to effectively preserve chromosome banding patterns and structure details in the reconstructed results.

Paper
Add Code

PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization

2 code implementations • 8 Jun 2023 • Yidong Wang, Zhuohao Yu, Zhengran Zeng, Linyi Yang, Cunxiang Wang, Hao Chen, Chaoya Jiang, Rui Xie, Jindong Wang, Xing Xie, Wei Ye, Shikun Zhang, Yue Zhang

To ensure the reliability of PandaLM, we collect a diverse human-annotated test dataset, where all contexts are generated by humans and labels are aligned with human preferences.

Language Modelling Large Language Model

840

Paper
Code

PromptBench: Towards Evaluating the Robustness of Large Language Models on Adversarial Prompts

1 code implementation • 7 Jun 2023 • Kaijie Zhu, Jindong Wang, Jiaheng Zhou, Zichen Wang, Hao Chen, Yidong Wang, Linyi Yang, Wei Ye, Yue Zhang, Neil Zhenqiang Gong, Xing Xie

The increasing reliance on Large Language Models (LLMs) across academia and industry necessitates a comprehensive understanding of their robustness to prompts.

Cross-Lingual Paraphrase Identification Machine Translation +5

1,974

Paper
Code

Out-of-Distribution Generalization in Text Classification: Past, Present, and Future

no code implementations • 23 May 2023 • Linyi Yang, Yaoxiao Song, Xuan Ren, Chenyang Lyu, Yidong Wang, Lingqiao Liu, Jindong Wang, Jennifer Foster, Yue Zhang

Machine learning (ML) systems in natural language processing (NLP) face significant challenges in generalizing to out-of-distribution (OOD) data, where the test distribution differs from the training data distribution.

Out-of-Distribution Generalization text-classification +1

Paper
Add Code

Deepfake Text Detection in the Wild

1 code implementation • 22 May 2023 • Yafu Li, Qintong Li, Leyang Cui, Wei Bi, Longyue Wang, Linyi Yang, Shuming Shi, Yue Zhang

In practical scenarios, the detector faces texts from various domains or LLMs without knowing their sources.

Face Swapping Story Generation +1

156

Paper
Code

Measuring Consistency in Text-based Financial Forecasting Models

1 code implementation • 15 May 2023 • Linyi Yang, Yingpeng Ma, Yue Zhang

Using FinTrust, we show that the consistency of state-of-the-art NLP models for financial forecasting is poor.

Paper
Code

Learning to Generalize for Cross-domain QA

1 code implementation • 14 May 2023 • Yingjie Niu, Linyi Yang, Ruihai Dong, Yue Zhang

Our method has been theoretically and empirically shown to be effective in enhancing the generalization ability of both generative and discriminative models.

Data Augmentation Domain Generalization +1

Paper
Code

On the Robustness of ChatGPT: An Adversarial and Out-of-distribution Perspective

1 code implementation • 22 Feb 2023 • Jindong Wang, Xixu Hu, Wenxin Hou, Hao Chen, Runkai Zheng, Yidong Wang, Linyi Yang, Haojun Huang, Wei Ye, Xiubo Geng, Binxin Jiao, Yue Zhang, Xing Xie

In this paper, we conduct a thorough evaluation of the robustness of ChatGPT from the adversarial and out-of-distribution (OOD) perspective.

Adversarial Robustness Chatbot +1

421

Paper
Code

Exploiting Rich Textual User-Product Context for Improving Sentiment Analysis

no code implementations • 17 Dec 2022 • Chenyang Lyu, Linyi Yang, Yue Zhang, Yvette Graham, Jennifer Foster

User and product information associated with a review is useful for sentiment polarity prediction.

Sentiment Analysis

Paper
Add Code

GLUE-X: Evaluating Natural Language Understanding Models from an Out-of-distribution Generalization Perspective

1 code implementation • 15 Nov 2022 • Linyi Yang, Shuibai Zhang, Libo Qin, Yafu Li, Yidong Wang, Hanmeng Liu, Jindong Wang, Xing Xie, Yue Zhang

Pre-trained language models (PLMs) are known to improve the generalization performance of natural language understanding models by leveraging large amounts of data during the pre-training phase.

Natural Language Understanding Out-of-Distribution Generalization

115

Paper
Code

Pre-Training a Graph Recurrent Network for Language Representation

1 code implementation • 8 Sep 2022 • Yile Wang, Linyi Yang, Zhiyang Teng, Ming Zhou, Yue Zhang

Transformer-based pre-trained models have gained much advance in recent years, becoming one of the most important backbones in natural language processing.

Language Modelling Sentence +2

Paper
Code

FactMix: Using a Few Labeled In-domain Examples to Generalize to Cross-domain Named Entity Recognition

1 code implementation • COLING 2022 • Linyi Yang, Lifan Yuan, Leyang Cui, Wenyang Gao, Yue Zhang

Few-shot Named Entity Recognition (NER) is imperative for entity tagging in limited resource domains and thus received proper attention in recent years.

Cross-Domain Named Entity Recognition Data Augmentation +2

Paper
Code

USB: A Unified Semi-supervised Learning Benchmark for Classification

4 code implementations • 12 Aug 2022 • Yidong Wang, Hao Chen, Yue Fan, Wang Sun, Ran Tao, Wenxin Hou, RenJie Wang, Linyi Yang, Zhi Zhou, Lan-Zhe Guo, Heli Qi, Zhen Wu, Yu-Feng Li, Satoshi Nakamura, Wei Ye, Marios Savvides, Bhiksha Raj, Takahiro Shinozaki, Bernt Schiele, Jindong Wang, Xing Xie, Yue Zhang

We further provide the pre-trained versions of the state-of-the-art neural models for CV tasks to make the cost affordable for further tuning.

Ranked #2 on Semi-Supervised Image Classification on CIFAR-100, 400 Labels

General Classification Semi-Supervised Image Classification

1,188

Paper
Code

Towards Fine-grained Causal Reasoning and QA

1 code implementation • 15 Apr 2022 • Linyi Yang, Zhen Wang, Yuxiang Wu, Jie Yang, Yue Zhang

Understanding causality is key to the success of NLP applications, especially in high-stakes domains.

Question Answering Sentence

Paper
Code

Challenges for Open-domain Targeted Sentiment Analysis

no code implementations • 14 Apr 2022 • Yun Luo, Hongjie Cai, Linyi Yang, Yanxia Qin, Rui Xia, Yue Zhang

Since previous studies on open-domain targeted sentiment analysis are limited in dataset domain variety and sentence level, we propose a novel dataset consisting of 6, 013 human-labeled data to extend the data domains in topics of interest and document level.

Sentence Sentiment Analysis

Paper
Add Code

A Rationale-Centric Framework for Human-in-the-loop Machine Learning

1 code implementation • ACL 2022 • Jinghui Lu, Linyi Yang, Brian Mac Namee, Yue Zhang

We present a novel rationale-centric framework with human-in-the-loop -- Rationales-centric Double-robustness Learning (RDL) -- to boost model out-of-distribution performance in few-shot learning scenarios.

BIG-bench Machine Learning Few-Shot Learning

Paper
Code

NumHTML: Numeric-Oriented Hierarchical Transformer Model for Multi-task Financial Forecasting

no code implementations • 5 Jan 2022 • Linyi Yang, Jiazheng Li, Ruihai Dong, Yue Zhang, Barry Smyth

Financial forecasting has been an important and active area of machine learning research because of the challenges it presents and the potential rewards that even minor improvements in prediction accuracy or forecasting may entail.

Paper
Add Code

Fact Check: Analyzing Financial Events from Multilingual News Sources

no code implementations • 29 Jun 2021 • Linyi Yang, Tin Lok James Ng, Barry Smyth, Ruihai Dong

The explosion in the sheer magnitude and complexity of financial news data in recent years makes it increasingly challenging for investment analysts to extract valuable insights and perform analysis.

Clustering

Paper
Add Code

Exploring the Efficacy of Automatically Generated Counterfactuals for Sentiment Analysis

1 code implementation • ACL 2021 • Linyi Yang, Jiazheng Li, Pádraig Cunningham, Yue Zhang, Barry Smyth, Ruihai Dong

While state-of-the-art NLP models have been achieving the excellent performance of a wide range of tasks in recent years, important questions are being raised about their robustness and their underlying sensitivity to systematic biases that may exist in their training and test data.

counterfactual Data Augmentation +1

Paper
Code

Generating Plausible Counterfactual Explanations for Deep Transformers in Financial Text Classification

no code implementations • COLING 2020 • Linyi Yang, Eoin M. Kenny, Tin Lok James Ng, Yi Yang, Barry Smyth, Ruihai Dong

Corporate mergers and acquisitions (M&A) account for billions of dollars of investment globally every year, and offer an interesting and challenging domain for artificial intelligence.

counterfactual Explainable Artificial Intelligence (XAI) +3

Paper
Add Code

Leveraging BERT to Improve the FEARS Index for Stock Forecasting

no code implementations • WS 2019 • Linyi Yang, Ruihai Dong, Tin Lok James Ng, Yang Xu

Paper
Add Code

Explainable Text-Driven Neural Network for Stock Prediction

no code implementations • 13 Feb 2019 • Linyi Yang, Zheng Zhang, Su Xiong, Lirui Wei, James Ng, Lina Xu, Ruihai Dong

It has been shown that financial news leads to the fluctuation of stock prices.

Stock Prediction Stock Price Prediction

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.