Search Results for author: Lingming Zhang

Found 22 papers, 12 papers with code

XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts

1 code implementation • 23 Apr 2024 • Yifeng Ding, Jiawei Liu, Yuxiang Wei, Terry Yue Zhuo, Lingming Zhang

We introduce XFT, a simple yet powerful training scheme, by simply merging upcycled Mixture-of-Experts (MoE) to unleash the performance limit of instruction-tuned code Large Language Models (LLMs).

Paper
Code

Emerging Platforms Meet Emerging LLMs: A Year-Long Journey of Top-Down Development

no code implementations • 14 Apr 2024 • Siyuan Feng, Jiawei Liu, Ruihang Lai, Charlie F. Ruan, Yong Yu, Lingming Zhang, Tianqi Chen

While a traditional bottom-up development pipeline fails to close the gap timely, we introduce TapML, a top-down approach and tooling designed to streamline the deployment of ML systems on diverse platforms, optimized for developer productivity.

Paper
Add Code

Top Leaderboard Ranking = Top Coding Proficiency, Always? EvoEval: Evolving Coding Benchmarks via LLM

1 code implementation • 28 Mar 2024 • Chunqiu Steven Xia, Yinlin Deng, Lingming Zhang

Such limitations inevitably lead us to inquire: Is the leaderboard performance on existing benchmarks reliable and comprehensive enough to measure the program synthesis ability of LLMs?

Code Generation Instruction Following +1

Paper
Code

StarCoder 2 and The Stack v2: The Next Generation

no code implementations • 29 Feb 2024 • Anton Lozhkov, Raymond Li, Loubna Ben allal, Federico Cassano, Joel Lamy-Poirier, Nouamane Tazi, Ao Tang, Dmytro Pykhtar, Jiawei Liu, Yuxiang Wei, Tianyang Liu, Max Tian, Denis Kocetkov, Arthur Zucker, Younes Belkada, Zijian Wang, Qian Liu, Dmitry Abulkhanov, Indraneil Paul, Zhuang Li, Wen-Ding Li, Megan Risdal, Jia Li, Jian Zhu, Terry Yue Zhuo, Evgenii Zheltonozhskii, Nii Osae Osae Dade, Wenhao Yu, Lucas Krauß, Naman jain, Yixuan Su, Xuanli He, Manan Dey, Edoardo Abati, Yekun Chai, Niklas Muennighoff, Xiangru Tang, Muhtasham Oblokulov, Christopher Akiki, Marc Marone, Chenghao Mou, Mayank Mishra, Alex Gu, Binyuan Hui, Tri Dao, Armel Zebaze, Olivier Dehaene, Nicolas Patry, Canwen Xu, Julian McAuley, Han Hu, Torsten Scholak, Sebastien Paquet, Jennifer Robinson, Carolyn Jane Anderson, Nicolas Chapados, Mostofa Patwary, Nima Tajbakhsh, Yacine Jernite, Carlos Muñoz Ferrandis, Lingming Zhang, Sean Hughes, Thomas Wolf, Arjun Guha, Leandro von Werra, Harm de Vries

Our large model, StarCoder2- 15B, significantly outperforms other models of comparable size.

Ranked #25 on Code Generation on MBPP

Code Completion Code Generation +1

Paper
Add Code

KernelGPT: Enhanced Kernel Fuzzing via Large Language Models

no code implementations • 31 Dec 2023 • Chenyuan Yang, Zijie Zhao, Lingming Zhang

Bugs in operating system kernels can affect billions of devices and users all over the world.

valid

Paper
Add Code

Magicoder: Source Code Is All You Need

1 code implementation • 4 Dec 2023 • Yuxiang Wei, Zhe Wang, Jiawei Liu, Yifeng Ding, Lingming Zhang

Magicoder models are trained on 75K synthetic instruction data using OSS-Instruct, a novel approach to enlightening LLMs with open-source code snippets to generate high-quality instruction data for code.

Code Generation Text-to-Code Generation

1,888

Paper
Code

White-box Compiler Fuzzing Empowered by Large Language Models

1 code implementation • 24 Oct 2023 • Chenyuan Yang, Yinlin Deng, Runyu Lu, Jiayi Yao, Jiawei Liu, Reyhaneh Jabbarvand, Lingming Zhang

Nonetheless, prompting LLMs with compiler source-code information remains a missing piece of research in compiler testing.

Code Generation Compiler Optimization

Paper
Code

Copiloting the Copilots: Fusing Large Language Models with Completion Engines for Automated Program Repair

1 code implementation • 1 Sep 2023 • Yuxiang Wei, Chunqiu Steven Xia, Lingming Zhang

Therefore, we propose Repilot, a general code generation framework to further copilot the AI "copilots" (i. e., LLMs) by synthesizing more valid patches during the repair process.

Code Generation Program Repair +1

125

Paper
Code

Fuzz4All: Universal Fuzzing with Large Language Models

1 code implementation • 9 Aug 2023 • Chunqiu Steven Xia, Matteo Paltenghi, Jia Le Tian, Michael Pradel, Lingming Zhang

Moreover, the inputs generated by existing fuzzers are often limited to specific features of the input language, and thus can hardly reveal bugs related to other or new features.

Paper
Code

Is Your Code Generated by ChatGPT Really Correct? Rigorous Evaluation of Large Language Models for Code Generation

1 code implementation • NeurIPS 2023 • Jiawei Liu, Chunqiu Steven Xia, Yuyao Wang, Lingming Zhang

While EvalPlus is general, we extend the test-cases of the popular HumanEval benchmark by 80x to build HumanEval+.

Code Generation Program Synthesis

899

Paper
Code

Keep the Conversation Going: Fixing 162 out of 337 bugs for $0.42 each using ChatGPT

no code implementations • 1 Apr 2023 • Chunqiu Steven Xia, Lingming Zhang

For earlier patches that failed to pass all tests, we combine the incorrect patches with their corresponding relevant test failure information to construct a new prompt for the LLM to generate the next patch.

Program Repair

Paper
Add Code

Revisiting the Plastic Surgery Hypothesis via Large Language Models

no code implementations • 18 Mar 2023 • Chunqiu Steven Xia, Yifeng Ding, Lingming Zhang

Traditional APR tools have largely leveraged the plastic surgery hypothesis by designing manual or heuristic-based approaches to exploit such existing code ingredients.

Program Repair

Paper
Add Code

NeuRI: Diversifying DNN Generation via Inductive Rule Inference

1 code implementation • 4 Feb 2023 • Jiawei Liu, Jinjun Peng, Yuyao Wang, Lingming Zhang

NeuRI finds 100 new bugs for PyTorch and TensorFlow in four months, with 81 already fixed or confirmed.

Decision Making Program Synthesis +1

Paper
Code

Conversational Automated Program Repair

no code implementations • 30 Jan 2023 • Chunqiu Steven Xia, Lingming Zhang

As such, we leverage the long-term context window of LLMs to not only avoid generating previously incorrect patches but also incorporate validation feedback to help the model understand the semantic meaning of the program under test.

Program Repair

Paper
Add Code

NNSmith: Generating Diverse and Valid Test Cases for Deep Learning Compilers

1 code implementation • 26 Jul 2022 • Jiawei Liu, JinKun Lin, Fabian Ruffy, Cheng Tan, Jinyang Li, Aurojit Panda, Lingming Zhang

In this work, we propose a new fuzz testing approach for finding bugs in deep-learning compilers.

valid

Paper
Code

Two-Stream Graph Convolutional Network for Intra-oral Scanner Image Segmentation

1 code implementation • 19 Apr 2022 • Yue Zhao, Lingming Zhang, Yang Liu, Deyu Meng, Zhiming Cui, Chenqiang Gao, Xinbo Gao, Chunfeng Lian, Dinggang Shen

The state-of-the-art deep learning-based methods often simply concatenate the raw geometric attributes (i. e., coordinates and normal vectors) of mesh cells to train a single-stream network for automatic intra-oral scanner image segmentation.

Graph Learning Image Segmentation +3

Paper
Code

Coverage-Guided Tensor Compiler Fuzzing with Joint IR-Pass Mutation

1 code implementation • 21 Feb 2022 • Jiawei Liu, Yuxiang Wei, Sen yang, Yinlin Deng, Lingming Zhang

Our results show that Tzer substantially outperforms existing fuzzing techniques on tensor compiler testing, with 75% higher coverage and 50% more valuable tests than the 2nd-best technique.

Paper
Code

3D Dental model segmentation with graph attentional convolution network

no code implementations • Pattern Recognition Letters 2021 • Yue Zhao, Lingming Zhang, Chongshi Yang, Yingyun Tan, Yang Liu, Pengcheng Li, Tianhao Huang, Chenqiang Gao

We have evaluated our network on a real-patient dataset of dental models acquired through 3D intraoral scanners, and experimental results show that our method outperforms state-of-the-art deep learning methods for 3D shape segmentation.

Segmentation

Paper
Add Code

TSGCNet: Discriminative Geometric Feature Learning With Two-Stream Graph Convolutional Network for 3D Dental Model Segmentation

no code implementations • CVPR 2021 • Lingming Zhang, Yue Zhao, Deyu Meng, Zhiming Cui, Chenqiang Gao, Xinbo Gao, Chunfeng Lian, Dinggang Shen

State-of-the-art methods directly concatenate the raw attributes of 3D inputs, namely coordinates and normal vectors of mesh cells, to train a single-stream network for fully-automated tooth segmentation.

Graph Learning

Paper
Add Code

TSGCNet: Discriminative Geometric Feature Learning with Two-Stream GraphConvolutional Network for 3D Dental Model Segmentation

no code implementations • 26 Dec 2020 • Lingming Zhang, Yue Zhao, Deyu Meng, Zhiming Cui, Chenqiang Gao, Xinbo Gao, Chunfeng Lian, Dinggang Shen

Graph Learning

Paper
Add Code

DeepBillboard: Systematic Physical-World Testing of Autonomous Driving Systems

no code implementations • 27 Dec 2018 • Husheng Zhou, Wei Li, Yuankun Zhu, Yuqun Zhang, Bei Yu, Lingming Zhang, Cong Liu

Furthermore, DeepBillboard is sufficiently robust and resilient for generating physical-world adversarial billboard tests for real-world driving under various weather conditions.

Autonomous Driving DNN Testing

Paper
Add Code

DeepRoad: GAN-based Metamorphic Autonomous Driving System Testing

1 code implementation • 7 Feb 2018 • Mengshi Zhang, Yuqun Zhang, Lingming Zhang, Cong Liu, Sarfraz Khurshid

In this paper, we propose DeepRoad, an unsupervised framework to automatically generate large amounts of accurate driving scenes to test the consistency of DNN-based autonomous driving systems across different scenes.

Software Engineering

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.