Search Results for author: Sijie Cheng

Found 16 papers, 10 papers with code

A Self-supervised Joint Training Framework for Document Reranking

no code implementations Findings (NAACL) 2022 Xiaozhi Zhu, Tianyong Hao, Sijie Cheng, Fu Lee Wang, Hai Liu

Pretrained language models such as BERT have been successfully applied to a wide range of natural language processing tasks and also achieved impressive performance in document reranking tasks.

Language Modelling Passage Ranking +1

StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language Models

2 code implementations12 Mar 2024 Zhicheng Guo, Sijie Cheng, Hao Wang, Shihao Liang, Yujia Qin, Peng Li, Zhiyuan Liu, Maosong Sun, Yang Liu

The virtual API server contains a caching system and API simulators which are complementary to alleviate the change in API status.

Benchmarking

DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning

no code implementations28 Feb 2024 Jianxiong Li, Jinliang Zheng, Yinan Zheng, Liyuan Mao, Xiao Hu, Sijie Cheng, Haoyi Niu, Jihao Liu, Yu Liu, Jingjing Liu, Ya-Qin Zhang, Xianyuan Zhan

Multimodal pretraining has emerged as an effective strategy for the trinity of goals of representation learning in autonomous robots: 1) extracting both local and global task progression information; 2) enforcing temporal consistency of visual representation; 3) capturing trajectory-level language grounding.

Contrastive Learning Decision Making +1

DEEM: Dynamic Experienced Expert Modeling for Stance Detection

no code implementations23 Feb 2024 Xiaolong Wang, Yile Wang, Sijie Cheng, Peng Li, Yang Liu

Recent work has made a preliminary attempt to use large language models (LLMs) to solve the stance detection task, showing promising results.

Stance Detection

Speak It Out: Solving Symbol-Related Problems with Symbol-to-Language Conversion for Language Models

1 code implementation22 Jan 2024 Yile Wang, Sijie Cheng, Zixin Sun, Peng Li, Yang Liu

We propose symbol-to-language (S2L), a tuning-free method that enables large language models to solve symbol-related problems with information expressed in natural language.

Property Prediction Question Answering +1

EgoThink: Evaluating First-Person Perspective Thinking Capability of Vision-Language Models

1 code implementation27 Nov 2023 Sijie Cheng, Zhicheng Guo, Jingwen Wu, Kechen Fang, Peng Li, Huaping Liu, Yang Liu

However, the capability of VLMs to "think" from a first-person perspective, a crucial attribute for advancing autonomous agents and robotics, remains largely unexplored.

Attribute Question Answering +1

OpenChat: Advancing Open-source Language Models with Mixed-Quality Data

1 code implementation20 Sep 2023 Guan Wang, Sijie Cheng, Xianyuan Zhan, Xiangang Li, Sen Song, Yang Liu

Specifically, we consider the general SFT training data, consisting of a small amount of expert data mixed with a large proportion of sub-optimal data, without any preference labels.

Arithmetic Reasoning Code Generation +1

Prompt-Guided Retrieval Augmentation for Non-Knowledge-Intensive Tasks

1 code implementation28 May 2023 Zhicheng Guo, Sijie Cheng, Yile Wang, Peng Li, Yang Liu

There are two main challenges to leveraging retrieval-augmented methods for NKI tasks: 1) the demand for diverse relevance score functions and 2) the dilemma between training cost and task performance.

Retrieval

Unsupervised Explanation Generation via Correct Instantiations

no code implementations21 Nov 2022 Sijie Cheng, Zhiyong Wu, Jiangjie Chen, Zhixing Li, Yang Liu, Lingpeng Kong

The major difficulty is finding the conflict point, where the statement contradicts our real world.

Explanation Generation

Learning What You Need from What You Did: Product Taxonomy Expansion with User Behaviors Supervision

1 code implementation28 Mar 2022 Sijie Cheng, Zhouhong Gu, Bang Liu, Rui Xie, Wei Wu, Yanghua Xiao

Specifically, i) to fully exploit user behavioral information, we extract candidate hyponymy relations that match user interests from query-click concepts; ii) to enhance the semantic information of new concepts and better detect hyponymy relations, we model concepts and relations through both user-generated content and structural information in existing taxonomies and user click logs, by leveraging Pre-trained Language Models and Graph Neural Network combined with Contrastive Learning; iii) to reduce the cost of dataset construction and overcome data skews, we construct a high-quality and balanced training dataset from existing taxonomy with no supervision.

Contrastive Learning Taxonomy Expansion

Can Pre-trained Language Models Interpret Similes as Smart as Human?

1 code implementation ACL 2022 Qianyu He, Sijie Cheng, Zhixu Li, Rui Xie, Yanghua Xiao

In this paper, we investigate the ability of PLMs in simile interpretation by designing a novel task named Simile Property Probing, i. e., to let the PLMs infer the shared properties of similes.

Sentiment Analysis Sentiment Classification

Unsupervised Editing for Counterfactual Stories

1 code implementation10 Dec 2021 Jiangjie Chen, Chun Gan, Sijie Cheng, Hao Zhou, Yanghua Xiao, Lei LI

We also propose a new metric to alleviate the shortcomings of current automatic metrics and better evaluate the trade-off.

counterfactual

FedGEMS: Federated Learning of Larger Server Models via Selective Knowledge Fusion

no code implementations21 Oct 2021 Sijie Cheng, Jingwen Wu, Yanghua Xiao, Yang Liu

Today data is often scattered among billions of resource-constrained edge devices with security and privacy constraints.

Federated Learning Image Classification

On Commonsense Cues in BERT for Solving Commonsense Tasks

no code implementations Findings (ACL) 2021 Leyang Cui, Sijie Cheng, Yu Wu, Yue Zhang

We quantitatively investigate the presence of structural commonsense cues in BERT when solving commonsense tasks, and the importance of such cues for the model prediction.

Sentiment Analysis Sentiment Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.