Search Results for author: Sijie Cheng

Found 16 papers, 10 papers with code

A Self-supervised Joint Training Framework for Document Reranking

no code implementations • Findings (NAACL) 2022 • Xiaozhi Zhu, Tianyong Hao, Sijie Cheng, Fu Lee Wang, Hai Liu

Pretrained language models such as BERT have been successfully applied to a wide range of natural language processing tasks and also achieved impressive performance in document reranking tasks.

Language Modelling Passage Ranking +1

Paper
Add Code

StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language Models

2 code implementations • 12 Mar 2024 • Zhicheng Guo, Sijie Cheng, Hao Wang, Shihao Liang, Yujia Qin, Peng Li, Zhiyuan Liu, Maosong Sun, Yang Liu

The virtual API server contains a caching system and API simulators which are complementary to alleviate the change in API status.

Benchmarking

4,399

Paper
Code

DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning

no code implementations • 28 Feb 2024 • Jianxiong Li, Jinliang Zheng, Yinan Zheng, Liyuan Mao, Xiao Hu, Sijie Cheng, Haoyi Niu, Jihao Liu, Yu Liu, Jingjing Liu, Ya-Qin Zhang, Xianyuan Zhan

Multimodal pretraining has emerged as an effective strategy for the trinity of goals of representation learning in autonomous robots: 1) extracting both local and global task progression information; 2) enforcing temporal consistency of visual representation; 3) capturing trajectory-level language grounding.

Contrastive Learning Decision Making +1

Paper
Add Code

DEEM: Dynamic Experienced Expert Modeling for Stance Detection

no code implementations • 23 Feb 2024 • Xiaolong Wang, Yile Wang, Sijie Cheng, Peng Li, Yang Liu

Recent work has made a preliminary attempt to use large language models (LLMs) to solve the stance detection task, showing promising results.

Stance Detection

Paper
Add Code

Speak It Out: Solving Symbol-Related Problems with Symbol-to-Language Conversion for Language Models

1 code implementation • 22 Jan 2024 • Yile Wang, Sijie Cheng, Zixin Sun, Peng Li, Yang Liu

We propose symbol-to-language (S2L), a tuning-free method that enables large language models to solve symbol-related problems with information expressed in natural language.

Property Prediction Question Answering +1

Paper
Code

EgoThink: Evaluating First-Person Perspective Thinking Capability of Vision-Language Models

1 code implementation • 27 Nov 2023 • Sijie Cheng, Zhicheng Guo, Jingwen Wu, Kechen Fang, Peng Li, Huaping Liu, Yang Liu

However, the capability of VLMs to "think" from a first-person perspective, a crucial attribute for advancing autonomous agents and robotics, remains largely unexplored.

Attribute Question Answering +1

Paper
Code

OpenChat: Advancing Open-source Language Models with Mixed-Quality Data

1 code implementation • 20 Sep 2023 • Guan Wang, Sijie Cheng, Xianyuan Zhan, Xiangang Li, Sen Song, Yang Liu

Specifically, we consider the general SFT training data, consisting of a small amount of expert data mixed with a large proportion of sub-optimal data, without any preference labels.

Ranked #22 on Code Generation on HumanEval

Arithmetic Reasoning Code Generation +1

4,972

Paper
Code

Prompt-Guided Retrieval Augmentation for Non-Knowledge-Intensive Tasks

1 code implementation • 28 May 2023 • Zhicheng Guo, Sijie Cheng, Yile Wang, Peng Li, Yang Liu

There are two main challenges to leveraging retrieval-augmented methods for NKI tasks: 1) the demand for diverse relevance score functions and 2) the dilemma between training cost and task performance.

Retrieval

Paper
Code

Modeling Adversarial Attack on Pre-trained Language Models as Sequential Decision Making

1 code implementation • 27 May 2023 • Xuanjie Fang, Sijie Cheng, Yang Liu, Wei Wang

Pre-trained language models (PLMs) have been widely used to underpin various downstream tasks.

Adversarial Attack Decision Making +2

Paper
Code

Say What You Mean! Large Language Models Speak Too Positively about Negative Commonsense Knowledge

1 code implementation • 10 May 2023 • Jiangjie Chen, Wei Shi, Ziquan Fu, Sijie Cheng, Lei LI, Yanghua Xiao

Large language models (LLMs) have been widely studied for their ability to store and utilize positive knowledge.

Language Modelling Negation +3

Paper
Code

Unsupervised Explanation Generation via Correct Instantiations

no code implementations • 21 Nov 2022 • Sijie Cheng, Zhiyong Wu, Jiangjie Chen, Zhixing Li, Yang Liu, Lingpeng Kong

The major difficulty is finding the conflict point, where the statement contradicts our real world.

Explanation Generation

Paper
Add Code

Learning What You Need from What You Did: Product Taxonomy Expansion with User Behaviors Supervision

1 code implementation • 28 Mar 2022 • Sijie Cheng, Zhouhong Gu, Bang Liu, Rui Xie, Wei Wu, Yanghua Xiao

Specifically, i) to fully exploit user behavioral information, we extract candidate hyponymy relations that match user interests from query-click concepts; ii) to enhance the semantic information of new concepts and better detect hyponymy relations, we model concepts and relations through both user-generated content and structural information in existing taxonomies and user click logs, by leveraging Pre-trained Language Models and Graph Neural Network combined with Contrastive Learning; iii) to reduce the cost of dataset construction and overcome data skews, we construct a high-quality and balanced training dataset from existing taxonomy with no supervision.

Contrastive Learning Taxonomy Expansion

Paper
Code

Can Pre-trained Language Models Interpret Similes as Smart as Human?

1 code implementation • ACL 2022 • Qianyu He, Sijie Cheng, Zhixu Li, Rui Xie, Yanghua Xiao

In this paper, we investigate the ability of PLMs in simile interpretation by designing a novel task named Simile Property Probing, i. e., to let the PLMs infer the shared properties of similes.

Sentiment Analysis Sentiment Classification

Paper
Code

Unsupervised Editing for Counterfactual Stories

1 code implementation • 10 Dec 2021 • Jiangjie Chen, Chun Gan, Sijie Cheng, Hao Zhou, Yanghua Xiao, Lei LI

We also propose a new metric to alleviate the shortcomings of current automatic metrics and better evaluate the trade-off.

counterfactual

Paper
Code

FedGEMS: Federated Learning of Larger Server Models via Selective Knowledge Fusion

no code implementations • 21 Oct 2021 • Sijie Cheng, Jingwen Wu, Yanghua Xiao, Yang Liu

Today data is often scattered among billions of resource-constrained edge devices with security and privacy constraints.

Federated Learning Image Classification

Paper
Add Code

On Commonsense Cues in BERT for Solving Commonsense Tasks

no code implementations • Findings (ACL) 2021 • Leyang Cui, Sijie Cheng, Yu Wu, Yue Zhang

We quantitatively investigate the presence of structural commonsense cues in BERT when solving commonsense tasks, and the importance of such cues for the model prediction.

Sentiment Analysis Sentiment Classification

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.