Search Results for author: Runze Wu

Found 20 papers, 12 papers with code

A Dataset for the Validation of Truth Inference Algorithms Suitable for Online Deployment

1 code implementation • 10 Mar 2024 • Fei Wang, Haoyu Liu, Haoyang Bi, Xiangzhuang Shen, Renyu Zhu, Runze Wu, Minmin Lin, Tangjie Lv, Changjie Fan, Qi Liu, Zhenya Huang, Enhong Chen

In this paper, we introduce a substantial crowdsourcing annotation dataset collected from a real-world crowdsourcing platform.

Paper
Code

XRL-Bench: A Benchmark for Evaluating and Comparing Explainable Reinforcement Learning Techniques

no code implementations • 20 Feb 2024 • Yu Xiong, Zhipeng Hu, Ye Huang, Runze Wu, Kai Guan, Xingchen Fang, Ji Jiang, Tianze Zhou, Yujing Hu, Haoyu Liu, Tangjie Lyu, Changjie Fan

To address this, we introduce XRL-Bench, a unified standardized benchmark tailored for the evaluation and comparison of XRL methods, encompassing three main modules: standard RL environments, explainers based on state importance, and standard evaluators.

Decision Making Reinforcement Learning (RL)

Paper
Add Code

FreeAL: Towards Human-Free Active Learning in the Era of Large Language Models

1 code implementation • 27 Nov 2023 • Ruixuan Xiao, Yiwen Dong, Junbo Zhao, Runze Wu, Minmin Lin, Gang Chen, Haobo Wang

While copious solutions, such as active learning for small language models (SLMs) and prevalent in-context learning in the era of large language models (LLMs), have been proposed and alleviate the labeling burden to some extent, their performances are still subject to human intervention.

Active Learning In-Context Learning

Paper
Code

Towards Long-term Annotators: A Supervised Label Aggregation Baseline

no code implementations • 15 Nov 2023 • Haoyu Liu, Fei Wang, Minmin Lin, Runze Wu, Renyu Zhu, Shiwei Zhao, Kai Wang, Tangjie Lv, Changjie Fan

These annotators could leave substantial historical annotation records on the crowdsourcing platforms, which can benefit label aggregation, but are ignored by previous works.

Paper
Add Code

Examining the Effect of Pre-training on Time Series Classification

no code implementations • 11 Sep 2023 • Jiashu Pu, Shiwei Zhao, Ling Cheng, Yongzhu Chang, Runze Wu, Tangjie Lv, Rongsheng Zhang

(iv) Adding more pre-training data does not improve generalization, but it can strengthen the advantage of pre-training on the original data volume, such as faster convergence.

Time Series Time Series Classification +1

Paper
Add Code

Rethinking Noisy Label Learning in Real-world Annotation Scenarios from the Noise-type Perspective

1 code implementation • 28 Jul 2023 • Renyu Zhu, Haoyu Liu, Runze Wu, Minmin Lin, Tangjie Lv, Changjie Fan, Haobo Wang

In this paper, we investigate the problem of learning with noisy labels in real-world annotation scenarios, where noise can be categorized into two types: factual noise and ambiguity noise.

Learning with noisy labels

Paper
Code

AutoMLP: Automated MLP for Sequential Recommendations

no code implementations • 11 Mar 2023 • Muyang Li, Zijian Zhang, Xiangyu Zhao, Wanyu Wang, Minghao Zhao, Runze Wu, Ruocheng Guo

Sequential recommender systems aim to predict users' next interested item given their historical interactions.

Recommendation Systems

Paper
Add Code

Adaptive Value Decomposition with Greedy Marginal Contribution Computation for Cooperative Multi-Agent Reinforcement Learning

1 code implementation • 14 Feb 2023 • Shanqi Liu, Yujing Hu, Runze Wu, Dong Xing, Yu Xiong, Changjie Fan, Kun Kuang, Yong liu

We first illustrate that the proposed value decomposition can consider the complicated interactions among agents and is feasible to learn in large-scale scenarios.

Multi-agent Reinforcement Learning

Paper
Code

TCFimt: Temporal Counterfactual Forecasting from Individual Multiple Treatment Perspective

no code implementations • 17 Dec 2022 • Pengfei Xi, Guifeng Wang, Zhipeng Hu, Yu Xiong, Mingming Gong, Wei Huang, Runze Wu, Yu Ding, Tangjie Lv, Changjie Fan, Xiangnan Feng

TCFimt constructs adversarial tasks in a seq2seq framework to alleviate selection and time-varying bias and designs a contrastive learning-based block to decouple a mixed treatment effect into separated main treatment effects and causal interactions which further improves estimation accuracy.

Contrastive Learning counterfactual +3

Paper
Add Code

ProMix: Combating Label Noise via Maximizing Clean Sample Utility

1 code implementation • 21 Jul 2022 • Ruixuan Xiao, Yiwen Dong, Haobo Wang, Lei Feng, Runze Wu, Gang Chen, Junbo Zhao

To overcome the potential side effect of excessive clean set selection procedure, we further devise a novel SSL framework that is able to train balanced and unbiased classifiers on the separated clean and noisy samples.

Ranked #1 on Learning with noisy labels on CIFAR-10N-Worst

Learning with noisy labels

Paper
Code

Investigating Accuracy-Novelty Performance for Graph-based Collaborative Filtering

1 code implementation • 26 Apr 2022 • Minghao Zhao, Le Wu, Yile Liang, Lei Chen, Jian Zhang, Qilin Deng, Kai Wang, Xudong Shen, Tangjie Lv, Runze Wu

While conventional CF models are known for facing the challenges of the popularity bias that favors popular items, one may wonder "Whether the existing graph-based CF models alleviate or exacerbate popularity bias of recommender systems?"

Collaborative Filtering Recommendation Systems

Paper
Code

MLP4Rec: A Pure MLP Architecture for Sequential Recommendations

no code implementations • 25 Apr 2022 • Muyang Li, Xiangyu Zhao, Chuan Lyu, Minghao Zhao, Runze Wu, Ruocheng Guo

In addition, most existing works assume that such sequential dependencies exist solely in the item embeddings, but neglect their existence among the item features.

Recommendation Systems

Paper
Add Code

Unsupervised Representation Learning of Player Behavioral Data with Confidence Guided Masking

1 code implementation • WWW 2022 • Jiashu Pu, Jianshi Lin, Xiaoxi Mao, Jianrong Tao, Xudong Shen, Yue Shang, Runze Wu

Players of online games generate rich behavioral data during gaming.

Feature Engineering Language Modelling +2

Paper
Code

Omni-frequency Channel-selection Representations for Unsupervised Anomaly Detection

1 code implementation • 1 Mar 2022 • Yufei Liang, Jiangning Zhang, Shiwei Zhao, Runze Wu, Yong liu, Shuwen Pan

Density-based and classification-based methods have ruled unsupervised anomaly detection in recent years, while reconstruction-based methods are rarely mentioned for the poor reconstruction ability and low performance.

Ranked #39 on Anomaly Detection on MVTec AD

Unsupervised Anomaly Detection

Paper
Code

RL4RS: A Real-World Dataset for Reinforcement Learning based Recommender System

1 code implementation • 18 Oct 2021 • Kai Wang, Zhene Zou, Minghao Zhao, Qilin Deng, Yue Shang, Yile Liang, Runze Wu, Xudong Shen, Tangjie Lyu, Changjie Fan

In summary, the RL4RS (Reinforcement Learning for Recommender Systems), a new resource with special concerns on the reality gaps, contains two real-world datasets, data understanding tools, tuned simulation environments, related advanced RL baselines, batch RL baselines, and counterfactual policy evaluation algorithms.

Combinatorial Optimization counterfactual +3

210

Paper
Code

Auto IV: Counterfactual Prediction via Automatic Instrumental Variable Decomposition

1 code implementation • 13 Jul 2021 • Junkun Yuan, Anpeng Wu, Kun Kuang, Bo Li, Runze Wu, Fei Wu, Lanfen Lin

We also learn confounder representations by encouraging them to be relevant to both the treatment and the outcome.

Causal Inference counterfactual +1

Paper
Code

Personalized Bundle Recommendation in Online Games

no code implementations • 12 Apr 2021 • Qilin Deng, Kai Wang, Minghao Zhao, Zhene Zou, Runze Wu, Jianrong Tao, Changjie Fan, Liang Chen

In business domains, \textit{bundling} is one of the most important marketing strategies to conduct product promotions, which is commonly used in online e-commerce and offline retailers.

Link Prediction Marketing +1

Paper
Add Code

Reinforcement Learning with a Disentangled Universal Value Function for Item Recommendation

no code implementations • 7 Apr 2021 • Kai Wang, Zhene Zou, Qilin Deng, Runze Wu, Jianrong Tao, Changjie Fan, Liang Chen, Peng Cui

As a part of the value function, free from the sparse and high-variance reward signals, a high-capacity reward-independent world model is trained to simulate complex environmental dynamics under a certain goal.

Model-based Reinforcement Learning Recommendation Systems +2

Paper
Add Code

OptMatch: Optimized Matchmaking via Modeling the High-Order Interactions on the Arena

1 code implementation • SIGKDD International Conference on Knowledge Discovery & Data Mining 2020 • Linxia Gong, Xiaochuan Feng, Dezhi Ye, Hao Li, Runze Wu, Jianrong Tao, Changjie Fan, Peng Cui

OptMatch contains an offline learning stage and an online planning stage.

Paper
Code

Learning Decomposed Representation for Counterfactual Inference

1 code implementation • 12 Jun 2020 • Anpeng Wu, Kun Kuang, Junkun Yuan, Bo Li, Runze Wu, Qiang Zhu, Yueting Zhuang, Fei Wu

The fundamental problem in treatment effect estimation from observational data is confounder identification and balancing.

counterfactual Counterfactual Inference

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.