Search Results for author: Zixian Ma

Found 7 papers, 6 papers with code

m&m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks

1 code implementation • 17 Mar 2024 • Zixian Ma, Weikai Huang, Jieyu Zhang, Tanmay Gupta, Ranjay Krishna

With m&m's, we evaluate 6 popular LLMs with 2 planning strategies (multi-step vs. step-by-step planning), 2 plan formats (JSON vs. code), and 3 types of feedback (parsing/verification/execution).

Paper
Code

SugarCrepe: Fixing Hackable Benchmarks for Vision-Language Compositionality

1 code implementation • NeurIPS 2023 • Cheng-Yu Hsieh, Jieyu Zhang, Zixian Ma, Aniruddha Kembhavi, Ranjay Krishna

In the last year alone, a surge of new benchmarks to measure compositional understanding of vision-language models have permeated the machine learning ecosystem.

Paper
Code

Model Sketching: Centering Concepts in Early-Stage Machine Learning Model Design

1 code implementation • 6 Mar 2023 • Michelle S. Lam, Zixian Ma, Anne Li, Izequiel Freitas, Dakuo Wang, James A. Landay, Michael S. Bernstein

Machine learning practitioners often end up tunneling on low-level technical details like model architectures and performance metrics.

Decision Making

Paper
Code

CREPE: Can Vision-Language Foundation Models Reason Compositionally?

1 code implementation • CVPR 2023 • Zixian Ma, Jerry Hong, Mustafa Omer Gul, Mona Gandhi, Irena Gao, Ranjay Krishna

To measure systematicity, CREPE consists of a test dataset containing over $370K$ image-text pairs and three different seen-unseen splits.

Ranked #1 on Image Retrieval on CREPE (Compositional REPresentation Evaluation)

Image Retrieval Negation +1

Paper
Code

ELIGN: Expectation Alignment as a Multi-Agent Intrinsic Reward

1 code implementation • 9 Oct 2022 • Zixian Ma, Rose Wang, Li Fei-Fei, Michael Bernstein, Ranjay Krishna

These results identify tasks where expectation alignment is a more useful strategy than curiosity-driven exploration for multi-agent coordination, enabling agents to do zero-shot coordination.

Multi-agent Reinforcement Learning

Paper
Code

MobilePhys: Personalized Mobile Camera-Based Contactless Physiological Sensing

no code implementations • 11 Jan 2022 • Xin Liu, Yuntao Wang, Sinan Xie, XiaoYu Zhang, Zixian Ma, Daniel McDuff, Shwetak Patel

Camera-based contactless photoplethysmography refers to a set of popular techniques for contactless physiological measurement.

Paper
Add Code

OpenAttack: An Open-source Textual Adversarial Attack Toolkit

1 code implementation • ACL 2021 • Guoyang Zeng, Fanchao Qi, Qianrui Zhou, Tingji Zhang, Zixian Ma, Bairu Hou, Yuan Zang, Zhiyuan Liu, Maosong Sun

Textual adversarial attacking has received wide and increasing attention in recent years.

Adversarial Attack

653

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.