HRS-Bench (Holistic, Reliable, and Scalable Benchmark)

Introduced by BAKR et al. in HRS-Bench: Holistic, Reliable and Scalable Benchmark for Text-to-Image Models

HRS-Bench is a concrete evaluation benchmark for T2I models that is Holistic, Reliable, and Scalable. It measures 13 skills that can be categorized into five major categories: accuracy, robustness, generalization, fairness, and bias. In addition, HRS-Bench covers 50 scenarios, including fashion, animals, transportation, food, and clothes.

Source: HRS-Bench: Holistic, Reliable and Scalable Benchmark for Text-to-Image Models

Homepage