Search Results for author: Ian Arawjo

Found 4 papers, 1 papers with code

Who Validates the Validators? Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences

no code implementations • 18 Apr 2024 • Shreya Shankar, J. D. Zamfirescu-Pereira, Björn Hartmann, Aditya G. Parameswaran, Ian Arawjo

In particular, we identify a phenomenon we dub \emph{criteria drift}: users need criteria to grade outputs, but grading outputs helps users define criteria.

Paper
Add Code

Antagonistic AI

no code implementations • 12 Feb 2024 • Alice Cai, Ian Arawjo, Elena L. Glassman

The vast majority of discourse around AI development assumes that subservient, "moral" models aligned with "human values" are universally beneficial -- in short, that good AI is sycophantic AI.

Paper
Add Code

Imagining a Future of Designing with AI: Dynamic Grounding, Constructive Negotiation, and Sustainable Motivation

no code implementations • 12 Feb 2024 • Priyan Vaithilingam, Ian Arawjo, Elena L. Glassman

We ideate a future design workflow that involves AI technology.

Paper
Add Code

ChainForge: A Visual Toolkit for Prompt Engineering and LLM Hypothesis Testing

1 code implementation • 17 Sep 2023 • Ian Arawjo, Chelse Swoopes, Priyan Vaithilingam, Martin Wattenberg, Elena Glassman

Evaluating outputs of large language models (LLMs) is challenging, requiring making -- and making sense of -- many responses.

Model Selection Prompt Engineering +1

2,012

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.