Search Results for author: Runze Wu

Found 20 papers, 12 papers with code

A Dataset for the Validation of Truth Inference Algorithms Suitable for Online Deployment

1 code implementation10 Mar 2024 Fei Wang, Haoyu Liu, Haoyang Bi, Xiangzhuang Shen, Renyu Zhu, Runze Wu, Minmin Lin, Tangjie Lv, Changjie Fan, Qi Liu, Zhenya Huang, Enhong Chen

In this paper, we introduce a substantial crowdsourcing annotation dataset collected from a real-world crowdsourcing platform.

XRL-Bench: A Benchmark for Evaluating and Comparing Explainable Reinforcement Learning Techniques

no code implementations20 Feb 2024 Yu Xiong, Zhipeng Hu, Ye Huang, Runze Wu, Kai Guan, Xingchen Fang, Ji Jiang, Tianze Zhou, Yujing Hu, Haoyu Liu, Tangjie Lyu, Changjie Fan

To address this, we introduce XRL-Bench, a unified standardized benchmark tailored for the evaluation and comparison of XRL methods, encompassing three main modules: standard RL environments, explainers based on state importance, and standard evaluators.

Decision Making Reinforcement Learning (RL)

FreeAL: Towards Human-Free Active Learning in the Era of Large Language Models

1 code implementation27 Nov 2023 Ruixuan Xiao, Yiwen Dong, Junbo Zhao, Runze Wu, Minmin Lin, Gang Chen, Haobo Wang

While copious solutions, such as active learning for small language models (SLMs) and prevalent in-context learning in the era of large language models (LLMs), have been proposed and alleviate the labeling burden to some extent, their performances are still subject to human intervention.

Active Learning In-Context Learning

Towards Long-term Annotators: A Supervised Label Aggregation Baseline

no code implementations15 Nov 2023 Haoyu Liu, Fei Wang, Minmin Lin, Runze Wu, Renyu Zhu, Shiwei Zhao, Kai Wang, Tangjie Lv, Changjie Fan

These annotators could leave substantial historical annotation records on the crowdsourcing platforms, which can benefit label aggregation, but are ignored by previous works.

Examining the Effect of Pre-training on Time Series Classification

no code implementations11 Sep 2023 Jiashu Pu, Shiwei Zhao, Ling Cheng, Yongzhu Chang, Runze Wu, Tangjie Lv, Rongsheng Zhang

(iv) Adding more pre-training data does not improve generalization, but it can strengthen the advantage of pre-training on the original data volume, such as faster convergence.

Time Series Time Series Classification +1

Rethinking Noisy Label Learning in Real-world Annotation Scenarios from the Noise-type Perspective

1 code implementation28 Jul 2023 Renyu Zhu, Haoyu Liu, Runze Wu, Minmin Lin, Tangjie Lv, Changjie Fan, Haobo Wang

In this paper, we investigate the problem of learning with noisy labels in real-world annotation scenarios, where noise can be categorized into two types: factual noise and ambiguity noise.

Learning with noisy labels

AutoMLP: Automated MLP for Sequential Recommendations

no code implementations11 Mar 2023 Muyang Li, Zijian Zhang, Xiangyu Zhao, Wanyu Wang, Minghao Zhao, Runze Wu, Ruocheng Guo

Sequential recommender systems aim to predict users' next interested item given their historical interactions.

Recommendation Systems

Adaptive Value Decomposition with Greedy Marginal Contribution Computation for Cooperative Multi-Agent Reinforcement Learning

1 code implementation14 Feb 2023 Shanqi Liu, Yujing Hu, Runze Wu, Dong Xing, Yu Xiong, Changjie Fan, Kun Kuang, Yong liu

We first illustrate that the proposed value decomposition can consider the complicated interactions among agents and is feasible to learn in large-scale scenarios.

Multi-agent Reinforcement Learning

TCFimt: Temporal Counterfactual Forecasting from Individual Multiple Treatment Perspective

no code implementations17 Dec 2022 Pengfei Xi, Guifeng Wang, Zhipeng Hu, Yu Xiong, Mingming Gong, Wei Huang, Runze Wu, Yu Ding, Tangjie Lv, Changjie Fan, Xiangnan Feng

TCFimt constructs adversarial tasks in a seq2seq framework to alleviate selection and time-varying bias and designs a contrastive learning-based block to decouple a mixed treatment effect into separated main treatment effects and causal interactions which further improves estimation accuracy.

Contrastive Learning counterfactual +3

ProMix: Combating Label Noise via Maximizing Clean Sample Utility

1 code implementation21 Jul 2022 Ruixuan Xiao, Yiwen Dong, Haobo Wang, Lei Feng, Runze Wu, Gang Chen, Junbo Zhao

To overcome the potential side effect of excessive clean set selection procedure, we further devise a novel SSL framework that is able to train balanced and unbiased classifiers on the separated clean and noisy samples.

Learning with noisy labels

Investigating Accuracy-Novelty Performance for Graph-based Collaborative Filtering

1 code implementation26 Apr 2022 Minghao Zhao, Le Wu, Yile Liang, Lei Chen, Jian Zhang, Qilin Deng, Kai Wang, Xudong Shen, Tangjie Lv, Runze Wu

While conventional CF models are known for facing the challenges of the popularity bias that favors popular items, one may wonder "Whether the existing graph-based CF models alleviate or exacerbate popularity bias of recommender systems?"

Collaborative Filtering Recommendation Systems

MLP4Rec: A Pure MLP Architecture for Sequential Recommendations

no code implementations25 Apr 2022 Muyang Li, Xiangyu Zhao, Chuan Lyu, Minghao Zhao, Runze Wu, Ruocheng Guo

In addition, most existing works assume that such sequential dependencies exist solely in the item embeddings, but neglect their existence among the item features.

Recommendation Systems

Omni-frequency Channel-selection Representations for Unsupervised Anomaly Detection

1 code implementation1 Mar 2022 Yufei Liang, Jiangning Zhang, Shiwei Zhao, Runze Wu, Yong liu, Shuwen Pan

Density-based and classification-based methods have ruled unsupervised anomaly detection in recent years, while reconstruction-based methods are rarely mentioned for the poor reconstruction ability and low performance.

Unsupervised Anomaly Detection

RL4RS: A Real-World Dataset for Reinforcement Learning based Recommender System

1 code implementation18 Oct 2021 Kai Wang, Zhene Zou, Minghao Zhao, Qilin Deng, Yue Shang, Yile Liang, Runze Wu, Xudong Shen, Tangjie Lyu, Changjie Fan

In summary, the RL4RS (Reinforcement Learning for Recommender Systems), a new resource with special concerns on the reality gaps, contains two real-world datasets, data understanding tools, tuned simulation environments, related advanced RL baselines, batch RL baselines, and counterfactual policy evaluation algorithms.

Combinatorial Optimization counterfactual +3

Auto IV: Counterfactual Prediction via Automatic Instrumental Variable Decomposition

1 code implementation13 Jul 2021 Junkun Yuan, Anpeng Wu, Kun Kuang, Bo Li, Runze Wu, Fei Wu, Lanfen Lin

We also learn confounder representations by encouraging them to be relevant to both the treatment and the outcome.

Causal Inference counterfactual +1

Personalized Bundle Recommendation in Online Games

no code implementations12 Apr 2021 Qilin Deng, Kai Wang, Minghao Zhao, Zhene Zou, Runze Wu, Jianrong Tao, Changjie Fan, Liang Chen

In business domains, \textit{bundling} is one of the most important marketing strategies to conduct product promotions, which is commonly used in online e-commerce and offline retailers.

Link Prediction Marketing +1

Reinforcement Learning with a Disentangled Universal Value Function for Item Recommendation

no code implementations7 Apr 2021 Kai Wang, Zhene Zou, Qilin Deng, Runze Wu, Jianrong Tao, Changjie Fan, Liang Chen, Peng Cui

As a part of the value function, free from the sparse and high-variance reward signals, a high-capacity reward-independent world model is trained to simulate complex environmental dynamics under a certain goal.

Model-based Reinforcement Learning Recommendation Systems +2

Learning Decomposed Representation for Counterfactual Inference

1 code implementation12 Jun 2020 Anpeng Wu, Kun Kuang, Junkun Yuan, Bo Li, Runze Wu, Qiang Zhu, Yueting Zhuang, Fei Wu

The fundamental problem in treatment effect estimation from observational data is confounder identification and balancing.

counterfactual Counterfactual Inference

Cannot find the paper you are looking for? You can Submit a new open access paper.