Faithfulness Critic

1 papers with code • 0 benchmarks • 0 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Faithfulness Critic

No evaluation results yet. Help compare methods by submitting evaluation metrics.

Most implemented papers

Most implemented Social Latest No code

Are self-explanations from Large Language Models faithful?

AndreasMadsen/llm-introspection • 15 Jan 2024

For example, if an LLM says a set of words is important for making a prediction, then it should not be able to make its prediction without these words.

Paper
Code

Faithfulness Critic

Benchmarks Add a Result

Most implemented papers

Are self-explanations from Large Language Models faithful?

Content

Benchmarks

Add a Result