Flickr30k-CNA (Flickr30k-Chinese All)

Introduced by Xie et al. in CCMB: A Large-scale Chinese Cross-modal Benchmark

Former Flickr30k-CN translates the training and validation sets of Flickr30k using machine translation and manually translates the test set. We check the machine-translated results and find two kinds of problems. (1) Some sentences have language problems and translation errors. (2) Some sentences have poor semantics. In addition, the different translation ways between the training set and test set prevent the model from achieving accurate performance. We gather 6 professional English and Chinese linguists to meticulously re-translate all data of Flickr30k and double-check each sentence.

Homepage

Benchmarks

Add a new result Link an existing benchmark

Task	Dataset Variant	Best Model
Zero-shot Image Retrieval	Flickr30k-CN	M2-Encoder
Image Retrieval	Flickr30k-CN	InternVL-G-FT
Zero-shot Text Retrieval	Flickr30k-CN	Alt-CLIP

Papers

Paper	Code	Results	Date	Stars

Dataset Loaders

Add Remove

No data loaders found. You can submit your data loader here.

Tasks

Similar Datasets

IQM

ICM

ImageNet_CN

Flickr30k-CNA (Flickr30k-Chinese All)

Benchmarks

Add a new result Link an existing benchmark

Papers

Dataset Loaders

Add Remove

Tasks

Similar Datasets

IQM

ICM

ImageNet_CN

COCO-CN

Usage

License

Modalities

Languages

Flickr30k-CNA (Flickr30k-Chinese All)

Benchmarks Edit Add a new result Link an existing benchmark

Papers

Dataset Loaders Edit Add Remove

Tasks Edit

Similar Datasets

IQM

ICM

ImageNet_CN

COCO-CN

Usage

License Edit

Modalities Edit

Languages Edit

Benchmarks

Add a new result Link an existing benchmark

Dataset Loaders

Add Remove

Tasks

License

Modalities

Languages