Search Results for author: Mingzhe Xing

Found 1 papers, 1 papers with code

Understanding the Weakness of Large Language Model Agents within a Complex Android Environment

1 code implementation9 Feb 2024 Mingzhe Xing, Rongkai Zhang, Hui Xue, Qi Chen, Fan Yang, Zhen Xiao

These challenges motivate AndroidArena, an environment and benchmark designed to evaluate LLM agents on a modern operating system.

Date Understanding Language Modelling +1

Cannot find the paper you are looking for? You can Submit a new open access paper.