TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Math Word Problem Solving	MAWPS	MsAT-DeductReasoner	Accuracy (%)	94.3	# 2
Math Word Problem Solving	SVAMP	MsAT-DeductReasoner	Execution Accuracy	48.9	# 13

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learning-multi-step-reasoning-from-arithmetic/math-word-problem-solving-on-mawps)](https://paperswithcode.com/sota/math-word-problem-solving-on-mawps?p=learning-multi-step-reasoning-from-arithmetic)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learning-multi-step-reasoning-from-arithmetic/math-word-problem-solving-on-svamp)](https://paperswithcode.com/sota/math-word-problem-solving-on-svamp?p=learning-multi-step-reasoning-from-arithmetic)`

Learning Multi-Step Reasoning by Solving Arithmetic Tasks

2 Jun 2023 · Tianduo Wang, Wei Lu ·

Mathematical reasoning is regarded as a necessary ability for Language Models (LMs). Recent works demonstrate large LMs' impressive performance in solving math problems. The success is attributed to their Chain-of-Thought (CoT) reasoning abilities, i.e., the ability to decompose complex questions into step-by-step reasoning chains, but such ability seems only to emerge from models with abundant parameters. This work investigates how to incorporate relatively small LMs with the capabilities of multi-step reasoning. We propose to inject such abilities by continually pre-training LMs on a synthetic dataset MsAT which is composed of Multi-step Arithmetic Tasks. Our experiments on four math word problem datasets show the effectiveness of the proposed method in enhancing LMs' math reasoning abilities.

PDF Abstract

Code

Add Remove Mark official

TianduoWang/MsAT official

Tasks

Add Remove

Math

Mathematical Reasoning

Math Word Problem Solving

Datasets

MATH

SVAMP ASDiv MAWPS

Results from the Paper

Edit

Ranked #2 on Math Word Problem Solving on MAWPS

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Result	Benchmark
Math Word Problem Solving	MAWPS	MsAT-DeductReasoner	Accuracy (%)	94.3	# 2		Compare
Math Word Problem Solving	SVAMP	MsAT-DeductReasoner	Execution Accuracy	48.9	# 13		Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Learning Multi-Step Reasoning by Solving Arithmetic Tasks

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove