Synthetic Grade School Math (SGSM)

Introduced by Christ et al. in MATHWELL: Generating Age-Appropriate Educational Math Word Problems

SGSM contains 20,490 question/answer pairs generated by MATHWELL, a context-free grade school math word problem generator that outputs a word problem and Program of Thought (PoT) solution based solely on an optional student interest. SGSM has two subsets: SGSM Train, comprised of 2,093 question/answer pairs verified by human experts, and SGSM Unannotated, comprised of 18,397 question/answer pairs that have executable code but are not verified by human experts. SGSM is the largest English grade school math QA dataset with PoT rationales.

SGSM is designed to train context-free grade school math word problem generators, but can also be used to train math QA models.

Papers


Paper Code Results Date Stars

Dataset Loaders


No data loaders found. You can submit your data loader here.

Tasks


Similar Datasets


License


Modalities


Languages