Natural Language Processing

multilingual cross-modal retrieval

2 papers with code • 0 benchmarks • 0 datasets

The task of multilingual cross-modal retrieval contains image-text retrieval tasks on different languages.

Benchmarks

Add a Result

These leaderboards are used to track progress in multilingual cross-modal retrieval

No evaluation results yet. Help compare methods by submitting evaluation metrics.

Most implemented papers

Most implemented Social Latest No code

mCLIP: Multilingual CLIP via Cross-lingual Transfer

ghchen18/acl23_mclip • • ACL 2023

Furthermore, to enhance the token- and sentence-level multilingual representation of the MTE, we propose to train it with machine translation and contrastive learning jointly before the TriKD to provide a better initialization.

Paper
Code

PaLI-3 Vision Language Models: Smaller, Faster, Stronger

kyegomez/PALI3 • • 13 Oct 2023

This paper presents PaLI-3, a smaller, faster, and stronger vision language model (VLM) that compares favorably to similar models that are 10x larger.