Texts

HumanEval-X

Introduced by Zheng et al. in CodeGeeX: A Pre-Trained Model for Code Generation with Multilingual Evaluations on HumanEval-X

HumanEval-X is a benchmark for evaluating the multilingual ability of code generative models. It consists of 820 high-quality human-crafted data samples (each with test cases) in Python, C++, Java, JavaScript, and Go, and can be used for various tasks, such as code generation and translation.

Homepage