Search Results for author: Guoxin Chen

Found 4 papers, 3 papers with code

AlphaMath Almost Zero: process Supervision without process

1 code implementation6 May 2024 Guoxin Chen, Minpeng Liao, Chengxi Li, Kai Fan

We proceed to train a step-level value model designed to improve the LLM's inference process in mathematical domains.

Mathematical Reasoning

SEER: Facilitating Structured Reasoning and Explanation via Reinforcement Learning

no code implementations24 Jan 2024 Guoxin Chen, Kexin Tang, Chao Yang, Fuying Ye, Yu Qiao, Yiming Qian

Moreover, existing reinforcement learning (RL) based methods overlook the structured relationships, underutilizing the potential of RL in structured reasoning.

Question Answering reinforcement-learning +1

Causality and Independence Enhancement for Biased Node Classification

1 code implementation14 Oct 2023 Guoxin Chen, Yongqing Wang, Fangda Guo, Qinglang Guo, Jiangli Shao, HuaWei Shen, Xueqi Cheng

Most existing methods that address out-of-distribution (OOD) generalization for node classification on graphs primarily focus on a specific type of data biases, such as label selection bias or structural bias.

Classification Node Classification +1

Cannot find the paper you are looking for? You can Submit a new open access paper.