Poses of People in Art: A Data Set for Human Pose Estimation in Digital Art History

12 Jan 2023  ยท  Stefanie Schneider, Ricarda Vollmer ยท

Throughout the history of art, the pose, as the holistic abstraction of the human body's expression, has proven to be a constant in numerous studies. However, due to the enormous amount of data that so far had to be processed by hand, its crucial role to the formulaic recapitulation of art-historical motifs since antiquity could only be highlighted selectively. This is true even for the now automated estimation of human poses, as domain-specific, sufficiently large data sets required for training computational models are either not publicly available or not indexed at a fine enough granularity. With the Poses of People in Art data set, we introduce the first openly licensed data set for estimating human poses in art and validating human pose estimators. It consists of 2,454 images from 22 art-historical depiction styles, including those that have increasingly turned away from lifelike representations of the body since the 19th century. A total of 10,749 human figures are precisely enclosed by rectangular bounding boxes, with a maximum of four per image labeled by up to 17 keypoints; among these are mainly joints such as elbows and knees. For machine learning purposes, the data set is divided into three subsets, training, validation, and testing, that follow the established JSON-based Microsoft COCO format, respectively. Each image annotation, in addition to mandatory fields, provides metadata from the art-historical online encyclopedia WikiArt. With this paper, we elaborate on the acquisition and constitution of the data set, address various application scenarios, and discuss prospects for a digitally supported art history. We show that the data set enables the investigation of body phenomena in art, whether at the level of individual figures, which can be captured in their subtleties, or entire figure constellations, whose position, distance, or proximity to one another is considered.

PDF Abstract

Results from the Paper


 Ranked #1 on Object Detection on PeopleArt (using extra training data)

     Get a GitHub badge
Task Dataset Model Metric Name Metric Value Global Rank Uses Extra
Training Data
Benchmark
Object Detection PeopleArt TOOD (Task-aligned One-stage Object Detection; trained on PeopleArt and PoPArt) mAP 47.8 # 2
mAP@0.5 78.0 # 2
mAP@0.75 49.9 # 2
Object Detection PeopleArt PVT (Pyramid Vision Transformer; trained on PeopleArt and PopArt) mAP 49.7 # 1
mAP@0.5 80.5 # 1
mAP@0.75 51.8 # 1
Object Detection PeopleArt TOOD (Task-aligned One-stage Object Detection) mAP 46.1 # 4
mAP@0.5 75.0 # 4
mAP@0.75 49.0 # 3
Object Detection PeopleArt PVT (Pyramid Vision Transformer) mAP 46.5 # 3
mAP@0.5 76.0 # 3
mAP@0.75 48.4 # 4

Methods


No methods listed for this paper. Add relevant methods here