1 code implementation • 15 Jan 2024 • Jakob Hackstein, Gencer Sumbul, Kai Norman Clasen, Begüm Demir
We finally derive a guideline to exploit masked image modeling for uni-modal and cross-modal CBIR problems in RS.
no code implementations • 2 Jun 2023 • David Hoffmann, Kai Norman Clasen, Begüm Demir
In this paper, we introduce a novel Synchronized Class Token Fusion (SCT Fusion) architecture in the framework of multi-modal multi-label classification (MLC) of remote sensing (RS) images.
no code implementations • 1 Jun 2023 • Leonard Hackel, Kai Norman Clasen, Mahdyar Ravanbakhsh, Begüm Demir
Visual question answering (VQA) methods in remote sensing (RS) aim to answer natural language questions with respect to an RS image.
no code implementations • 10 Oct 2022 • Tim Siebert, Kai Norman Clasen, Mahdyar Ravanbakhsh, Begüm Demir
To make the intrinsic information of each RS image easily accessible, visual question answering (VQA) has been introduced in RS.