AesBench

Introduced by Huang et al. in AesBench: An Expert Benchmark for Multimodal Large Language Models on Image Aesthetics Perception

AesBench is an expert benchmark designed to comprehensively evaluate the aesthetic perception capacities of Multimodal Large Language Models (MLLMs) when it comes to image aesthetics perception. Let me break it down for you:

Purpose and Challenge:
MLLMs, which combine language and vision, are rapidly advancing.
However, their performance in aesthetic perception (assessing the beauty or visual appeal of images) remains uncertain.
The lack of a specific benchmark for evaluating MLLMs in this domain hinders their further development.
What Is AesBench?:
AesBench addresses this challenge by providing a comprehensive benchmark.
It evaluates MLLMs' aesthetic perception abilities through dual facets:
- Expert-labeled Aesthetics Perception Database (EAPD): This database contains diverse image contents with high-quality annotations from professional aesthetic experts.
- Integrative Criteria: AesBench proposes criteria to measure MLLMs' aesthetic perception abilities from four perspectives:
- Perception (AesP): How well MLLMs perceive aesthetics.
- Empathy (AesE): Their ability to empathize with aesthetic preferences.
- Assessment (AesA): How accurately they assess aesthetics.
- Interpretation (AesI): Their understanding of aesthetic features.
Findings:
Extensive experiments reveal that current MLLMs possess only rudimentary aesthetic perception ability.
There remains a significant gap between MLLMs and human aesthetic perception.

In summary, AesBench provides a valuable tool for assessing how well MLLMs understand and appreciate the beauty of images. 📸🌟

(1) [2401.08276] AesBench: An Expert Benchmark for Multimodal Large .... https://arxiv.org/abs/2401.08276. (2) GitHub - yipoh/AesBench: An expert benchmark aiming to comprehensively .... https://github.com/yipoh/AesBench. (3) AesBench/README.md at main · yipoh/AesBench · GitHub. https://github.com/yipoh/AesBench/blob/main/README.md. (4) undefined. https://doi.org/10.48550/arXiv.2401.08276.

Homepage

Benchmarks

Add a new result Link an existing benchmark

No benchmarks yet. Start a new benchmark or link an existing one.

Papers

Paper	Code	Results	Date	Stars

Dataset Loaders

Add Remove

No data loaders found. You can submit your data loader here.

Tasks

Similar Datasets

Usage

License

Unknown

AesBench

Benchmarks

Add a new result Link an existing benchmark

Papers

Dataset Loaders

Add Remove

Tasks

Similar Datasets

SPAQ

ShareGPT4V

MVBench

MMVP

Usage

License

Modalities

Languages

AesBench

Benchmarks Edit Add a new result Link an existing benchmark

Papers

Dataset Loaders Edit Add Remove

Tasks Edit

Similar Datasets

SPAQ

ShareGPT4V

MVBench

MMVP

Usage

License Edit

Modalities Edit

Languages Edit

Benchmarks

Add a new result Link an existing benchmark

Dataset Loaders

Add Remove

Tasks

License

Modalities

Languages