1 code implementation • 27 Apr 2024 • Zeyang Ma, An Ran Chen, Dong Jae Kim, Tse-Hsun Chen, Shaowei Wang
Our evaluation of 16 open-source systems shows that LLMParser achieves statistically significantly higher parsing accuracy than state-of-the-art parsers (a 96% average parsing accuracy).
no code implementations • 23 Mar 2024 • Feng Lin, Dong Jae Kim, Tse-Husn, Chen
Results indicate LCGScrum outperforms other models, achieving Pass@1 scores of 75. 2, 65. 5, 82. 5, and 56. 7 in HumanEval, HumanEval-ET, MBPP, and MBPP-ET, respectively - an average 15% improvement over GPT.
Ranked #6 on Code Generation on MBPP