Search Results for author: Zixian Ma

Found 7 papers, 6 papers with code

m&m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks

1 code implementation17 Mar 2024 Zixian Ma, Weikai Huang, Jieyu Zhang, Tanmay Gupta, Ranjay Krishna

With m&m's, we evaluate 6 popular LLMs with 2 planning strategies (multi-step vs. step-by-step planning), 2 plan formats (JSON vs. code), and 3 types of feedback (parsing/verification/execution).

4k

SugarCrepe: Fixing Hackable Benchmarks for Vision-Language Compositionality

1 code implementation NeurIPS 2023 Cheng-Yu Hsieh, Jieyu Zhang, Zixian Ma, Aniruddha Kembhavi, Ranjay Krishna

In the last year alone, a surge of new benchmarks to measure compositional understanding of vision-language models have permeated the machine learning ecosystem.

Model Sketching: Centering Concepts in Early-Stage Machine Learning Model Design

1 code implementation6 Mar 2023 Michelle S. Lam, Zixian Ma, Anne Li, Izequiel Freitas, Dakuo Wang, James A. Landay, Michael S. Bernstein

Machine learning practitioners often end up tunneling on low-level technical details like model architectures and performance metrics.

Decision Making

ELIGN: Expectation Alignment as a Multi-Agent Intrinsic Reward

1 code implementation9 Oct 2022 Zixian Ma, Rose Wang, Li Fei-Fei, Michael Bernstein, Ranjay Krishna

These results identify tasks where expectation alignment is a more useful strategy than curiosity-driven exploration for multi-agent coordination, enabling agents to do zero-shot coordination.

Multi-agent Reinforcement Learning

MobilePhys: Personalized Mobile Camera-Based Contactless Physiological Sensing

no code implementations11 Jan 2022 Xin Liu, Yuntao Wang, Sinan Xie, XiaoYu Zhang, Zixian Ma, Daniel McDuff, Shwetak Patel

Camera-based contactless photoplethysmography refers to a set of popular techniques for contactless physiological measurement.

Cannot find the paper you are looking for? You can Submit a new open access paper.