Search Results for author: Rajat Koner

Found 12 papers, 9 papers with code

GRAtt-VIS: Gated Residual Attention for Auto Rectifying Video Instance Segmentation

1 code implementation • 26 May 2023 • Tanveer Hannan, Rajat Koner, Maximilian Bernhard, Suprosanna Shit, Bjoern Menze, Volker Tresp, Matthias Schubert, Thomas Seidl

Secondly, we propose a novel inter-instance interaction using gate activation as a mask for self-attention.

Ranked #4 on Video Instance Segmentation on YouTube-VIS 2021 (using extra training data)

Instance Segmentation Semantic Segmentation +1

Paper
Code

Do DALL-E and Flamingo Understand Each Other?

no code implementations • ICCV 2023 • Hang Li, Jindong Gu, Rajat Koner, Sahand Sharifzadeh, Volker Tresp

To study this question, we propose a reconstruction task where Flamingo generates a description for a given image and DALL-E uses this description as input to synthesize a new image.

Image Captioning Image Reconstruction +3

Paper
Add Code

InstanceFormer: An Online Video Instance Segmentation Framework

1 code implementation • 22 Aug 2022 • Rajat Koner, Tanveer Hannan, Suprosanna Shit, Sahand Sharifzadeh, Matthias Schubert, Thomas Seidl, Volker Tresp

We propose three novel components to model short-term and long-term dependency and temporal coherence.

Ranked #5 on Video Instance Segmentation on Youtube-VIS 2022 Validation (using extra training data)

Decoder Instance Segmentation +2

Paper
Code

Relationformer: A Unified Framework for Image-to-Graph Generation

1 code implementation • 19 Mar 2022 • Suprosanna Shit, Rajat Koner, Bastian Wittmann, Johannes Paetzold, Ivan Ezhov, Hongwei Li, Jiazhen Pan, Sahand Sharifzadeh, Georgios Kaissis, Volker Tresp, Bjoern Menze

We leverage direct set-based object prediction and incorporate the interaction among the objects to learn an object-relation representation jointly.

Graph Generation Object +4

Paper
Code

Is it all a cluster game? -- Exploring Out-of-Distribution Detection based on Clustering in the Embedding Space

no code implementations • 16 Mar 2022 • Poulami Sinhamahapatra, Rajat Koner, Karsten Roscher, Stephan Günnemann

It is essential for safety-critical applications of deep neural networks to determine when new inputs are significantly different from the training distribution.

Contrastive Learning Out-of-Distribution Detection +1

Paper
Add Code

Box Supervised Video Segmentation Proposal Network

1 code implementation • 14 Feb 2022 • Tanveer Hannan, Rajat Koner, Jonathan Kobold, Matthias Schubert

Video Object Segmentation (VOS) has been targeted by various fully-supervised and self-supervised approaches.

Image Segmentation Motion Compensation +6

Paper
Code

OODformer: Out-Of-Distribution Detection Transformer

1 code implementation • 19 Jul 2021 • Rajat Koner, Poulami Sinhamahapatra, Karsten Roscher, Stephan Günnemann, Volker Tresp

A serious problem in image classification is that a trained model might perform well for input data that originates from the same distribution as the data available for model training, but performs much worse for out-of-distribution (OOD) samples.

Contrastive Learning Out-of-Distribution Detection +1

Paper
Code

Graphhopper: Multi-Hop Scene Graph Reasoning for Visual Question Answering

1 code implementation • 13 Jul 2021 • Rajat Koner, Hang Li, Marcel Hildebrandt, Deepan Das, Volker Tresp, Stephan Günnemann

We conduct an experimental study on the challenging dataset GQA, based on both manually curated and automatically generated scene graphs.

Navigate Question Answering +1

Paper
Code

Scenes and Surroundings: Scene Graph Generation using Relation Transformer

1 code implementation • 12 Jul 2021 • Rajat Koner, Poulami Sinhamahapatra, Volker Tresp

Identifying objects in an image and their mutual relationships as a scene graph leads to a deep understanding of image content.

Graph Generation Object +2

Paper
Code

Scene Graph Reasoning for Visual Question Answering

no code implementations • 2 Jul 2020 • Marcel Hildebrandt, Hang Li, Rajat Koner, Volker Tresp, Stephan Günnemann

We propose a novel method that approaches the task by performing context-driven, sequential reasoning based on the objects and their semantic and spatial relationships present in the scene.

Navigate Question Answering +1

Paper
Add Code

Relation Transformer Network

2 code implementations • 13 Apr 2020 • Rajat Koner, Suprosanna Shit, Volker Tresp

In this work, we propose a novel transformer formulation for scene graph generation and relation prediction.

Decoder Graph Generation +3

Paper
Code

Improving Visual Relation Detection using Depth Maps

1 code implementation • 2 May 2019 • Sahand Sharifzadeh, Sina Moayed Baharlou, Max Berrendorf, Rajat Koner, Volker Tresp

We argue that depth maps can additionally provide valuable information on object relations, e. g. helping to detect not only spatial relations, such as standing behind, but also non-spatial relations, such as holding.

Ranked #1 on Relationship Detection on VRD

Object Relation +2

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.