Search Results for author: Weixiang Yan

Found 4 papers, 2 papers with code

CodeHalu: Code Hallucinations in LLMs Driven by Execution-based Verification

no code implementations30 Apr 2024 Yuchen Tian, Weixiang Yan, Qian Yang, Qian Chen, Wen Wang, Ziyang Luo, Lei Ma

Large Language Models (LLMs) have made significant advancements in the field of code generation, offering unprecedented support for automated programming and assisting developers.

Code Generation Hallucination

CodeScope: An Execution-based Multilingual Multitask Multidimensional Benchmark for Evaluating LLMs on Code Understanding and Generation

1 code implementation14 Nov 2023 Weixiang Yan, Haitian Liu, Yunkun Wang, Yunzhe Li, Qian Chen, Wen Wang, Tingyu Lin, Weishan Zhao, Li Zhu, Shuiguang Deng, Hari Sundaram

To bridge these gaps between existing benchmarks and expectations from practical applications, we introduce CodeScope, an execution-based, multilingual, multi-task, multi-dimensional evaluation benchmark for comprehensively gauging LLM capabilities on coding tasks.

Code Generation

CodeTransOcean: A Comprehensive Multilingual Benchmark for Code Translation

1 code implementation8 Oct 2023 Weixiang Yan, Yuchen Tian, Yunzhe Li, Qian Chen, Wen Wang

To advance research on code translation and meet diverse requirements of real-world applications, we construct CodeTransOcean, a large-scale comprehensive benchmark that supports the largest variety of programming languages for code translation.

Code Translation Machine Translation +1

Advancing Precise Outline-Conditioned Text Generation with Task Duality and Explicit Outline Control

no code implementations23 May 2023 Yunzhe Li, Qian Chen, Weixiang Yan, Wen Wang, Qinglin Zhang, Hari Sundaram

Furthermore, we identify an issue of imbalanced utilization of the outline information in the precise outline-conditioned generation, which is ubiquitously observed across fine-tuned models and zero-shot inference models.

Sentence Text Generation

Cannot find the paper you are looking for? You can Submit a new open access paper.