Search Results for author: Qixiang Fang

Found 10 papers, 4 papers with code

PATCH -- Psychometrics-AssisTed benCHmarking of Large Language Models: A Case Study of Mathematics Proficiency

no code implementations2 Apr 2024 Qixiang Fang, Daniel L. Oberski, Dong Nguyen

Third, we release 4 datasets to support measuring and comparing LLM proficiency in grade school mathematics and science against human populations.

Benchmarking

USE: Dynamic User Modeling with Stateful Sequence Models

no code implementations20 Mar 2024 Zhihan Zhou, Qixiang Fang, Leonardo Neves, Francesco Barbieri, Yozen Liu, Han Liu, Maarten W. Bos, Ron Dotsch

Furthermore, we introduce a novel training objective named future W-behavior prediction to transcend the limitations of next-token prediction by forecasting a broader horizon of upcoming user behaviors.

Contrastive Learning

Epicurus at SemEval-2023 Task 4: Improving Prediction of Human Values behind Arguments by Leveraging Their Definitions

1 code implementation27 Feb 2023 Christian Fang, Qixiang Fang, Dong Nguyen

We describe our experiments for SemEval-2023 Task 4 on the identification of human values behind arguments (ValueEval).

Evaluating the Construct Validity of Text Embeddings with Application to Survey Questions

1 code implementation18 Feb 2022 Qixiang Fang, Dong Nguyen, Daniel L Oberski

Our results thus highlight the necessity to examine the construct validity of text embeddings before deploying them in social science research.

Sentence valid

Assessing the Reliability of Word Embedding Gender Bias Measures

1 code implementation EMNLP 2021 Yupei Du, Qixiang Fang, Dong Nguyen

In this paper, we assess three types of reliability of word embedding gender bias measures, namely test-retest reliability, inter-rater consistency and internal consistency.

Word Embeddings

Cannot find the paper you are looking for? You can Submit a new open access paper.