1 code implementation • 11 May 2023 • Onur Galoğlu, Robert Litschko, Goran Glavaš
While a large body of work leveraged MMTs to mine parallel data and induce bilingual document embeddings, much less effort has been devoted to training general-purpose (massively) multilingual document encoder that can be used for both supervised and unsupervised document-level tasks.