MuMu is a new dataset of more than 31k albums classified into 250 genre classes.
3 PAPERS • NO BENCHMARKS YET
PTVD is a plot-oriented multimodal dataset in the TV domain. It is also the first non-English dataset of its kind. Additionally, PTVD contains more than 26 million bullet screen comments (BSCs), powering large-scale pre-training.
1 PAPER • NO BENCHMARKS YET
Trailers12k is a movie trailer dataset comprised of 12,000 titles associated to ten genres. It distinguishes from other datasets by its collection procedure aimed at providing a high-quality publicly available dataset.