HRS-Bench (Holistic, Reliable, and Scalable Benchmark)

Introduced by BAKR et al. in HRS-Bench: Holistic, Reliable and Scalable Benchmark for Text-to-Image Models

HRS-Bench is a concrete evaluation benchmark for T2I models that is Holistic, Reliable, and Scalable. It measures 13 skills that can be categorized into five major categories: accuracy, robustness, generalization, fairness, and bias. In addition, HRS-Bench covers 50 scenarios, including fashion, animals, transportation, food, and clothes.

Source: HRS-Bench: Holistic, Reliable and Scalable Benchmark for Text-to-Image Models

Papers


Paper Code Results Date Stars

Dataset Loaders


No data loaders found. You can submit your data loader here.

Tasks


Similar Datasets