1 code implementation • 30 Nov 2023 • Zicong Fan, Maria Parelli, Maria Eleni Kadoglou, Muhammed Kocabas, Xu Chen, Michael J. Black, Otmar Hilliges
Since humans interact with diverse objects every day, the holistic 3D capture of these interactions is important to understand and model human behaviour.
no code implementations • 7 Sep 2023 • Maria Parelli, Dimitrios Mallis, Markos Diomataris, Vassilis Pitsikalis
Transformer-based architectures have recently demonstrated remarkable performance in the Visual Question Answering (VQA) task.
no code implementations • 4 Jun 2023 • Alexandros Delitzas, Maria Parelli, Nikolas Hars, Georgios Vlassis, Sotirios Anagnostidis, Gregor Bachmann, Thomas Hofmann
Training models to apply common-sense linguistic knowledge and visual concepts from 2D images to 3D scene understanding is a promising direction that researchers have only recently started to explore.
1 code implementation • 12 Apr 2023 • Maria Parelli, Alexandros Delitzas, Nikolas Hars, Georgios Vlassis, Sotirios Anagnostidis, Gregor Bachmann, Thomas Hofmann
Training models to apply linguistic knowledge and visual concepts from 2D images to 3D world understanding is a promising direction that researchers have only recently started to explore.