Search Results for author: Martin Danelljan

Found 96 papers, 66 papers with code

Efficient Video Object Segmentation via Modulated Cross-Attention Memory

1 code implementation • 26 Mar 2024 • Abdelrahman Shaker, Syed Talal Wasim, Martin Danelljan, Salman Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan

Recently, transformer-based approaches have shown promising results for semi-supervised video object segmentation.

Object Segmentation +3

Paper
Code

Analyzing Local Representations of Self-supervised Vision Transformers

no code implementations • 31 Dec 2023 • Ani Vanyan, Alvard Barseghyan, Hakob Tamazyan, Vahan Huroyan, Hrant Khachatrian, Martin Danelljan

In this paper, we present a comparative analysis of various self-supervised Vision Transformers (ViTs), focusing on their local representative power.

Contrastive Learning Few-Shot Semantic Segmentation +2

Paper
Add Code

Diffusion-Based Particle-DETR for BEV Perception

no code implementations • 18 Dec 2023 • Asen Nachkov, Martin Danelljan, Danda Pani Paudel, Luc van Gool

For the enhanced safety of AVs, modeling perception uncertainty in BEV is crucial.

Autonomous Vehicles Object +2

Paper
Add Code

Gaussian Grouping: Segment and Edit Anything in 3D Scenes

1 code implementation • 1 Dec 2023 • Mingqiao Ye, Martin Danelljan, Fisher Yu, Lei Ke

To address this issue, we propose Gaussian Grouping, which extends Gaussian Splatting to jointly reconstruct and segment anything in open-world 3D scenes.

Colorization Novel View Synthesis +2

423

Paper
Code

Probabilistic Sampling of Balanced K-Means using Adiabatic Quantum Computing

no code implementations • 18 Oct 2023 • Jan-Nico Zaech, Martin Danelljan, Luc van Gool

Adiabatic quantum computing (AQC) is a promising quantum computing approach for discrete and often NP-hard optimization problems.

Clustering

Paper
Add Code

R3D3: Dense 3D Reconstruction of Dynamic Scenes from Multiple Cameras

no code implementations • ICCV 2023 • Aron Schmied, Tobias Fischer, Martin Danelljan, Marc Pollefeys, Fisher Yu

We propose R3D3, a multi-camera system for dense 3D reconstruction and ego-motion estimation.

3D Reconstruction Autonomous Driving +4

Paper
Add Code

MolGrapher: Graph-based Visual Recognition of Chemical Structures

1 code implementation • ICCV 2023 • Lucas Morin, Martin Danelljan, Maria Isabel Agea, Ahmed Nassar, Valery Weber, Ingmar Meijer, Peter Staar, Fisher Yu

In addition, we introduce a large-scale benchmark of annotated real molecule images, USPTO-30K, to spur research on this critical topic.

Paper
Code

Strategic Preys Make Acute Predators: Enhancing Camouflaged Object Detectors by Generating Camouflaged Objects

1 code implementation • 6 Aug 2023 • Chunming He, Kai Li, Yachao Zhang, Yulun Zhang, Zhenhua Guo, Xiu Li, Martin Danelljan, Fisher Yu

On the prey side, we propose an adversarial training framework, Camouflageator, which introduces an auxiliary generator to generate more camouflaged objects that are harder for a COD method to detect.

object-detection Object Detection

Paper
Code

Cascade-DETR: Delving into High-Quality Universal Object Detection

1 code implementation • ICCV 2023 • Mingqiao Ye, Lei Ke, Siyuan Li, Yu-Wing Tai, Chi-Keung Tang, Martin Danelljan, Fisher Yu

While dominating on the COCO benchmark, recent Transformer-based detection methods are not competitive in diverse domains.

Object object-detection +2

Paper
Code

Prompting Diffusion Representations for Cross-Domain Semantic Segmentation

no code implementations • 5 Jul 2023 • Rui Gong, Martin Danelljan, Han Sun, Julio Delgado Mangas, Luc van Gool

Intrigued by this result, we set out to explore how well diffusion-pretrained representations generalize to new domains, a crucial ability for any representation.

Domain Generalization Image Generation +2

Paper
Add Code

Segment Anything Meets Point Tracking

1 code implementation • 3 Jul 2023 • Frano Rajič, Lei Ke, Yu-Wing Tai, Chi-Keung Tang, Martin Danelljan, Fisher Yu

The Segment Anything Model (SAM) has established itself as a powerful zero-shot image segmentation model, enabled by efficient point-centric annotation and prompt-based models.

Interactive Video Object Segmentation Object +5

905

Paper
Code

Segment Anything in High Quality

2 code implementations • NeurIPS 2023 • Lei Ke, Mingqiao Ye, Martin Danelljan, Yifan Liu, Yu-Wing Tai, Chi-Keung Tang, Fisher Yu

HQ-SAM is only trained on the introduced detaset of 44k masks, which takes only 4 hours on 8 GPUs.

Ranked #1 on Zero-Shot Instance Segmentation on LVIS v1.0 val

Zero-Shot Instance Segmentation Zero Shot Segmentation

13,472

Paper
Code

StyleGenes: Discrete and Efficient Latent Distributions for GANs

no code implementations • 30 Apr 2023 • Evangelos Ntavelis, Mohamad Shahbazi, Iason Kastanis, Radu Timofte, Martin Danelljan, Luc van Gool

Thus, by independently sampling a variant for each gene and combining them into the final latent vector, our approach can represent a vast number of unique latent samples from a compact set of learnable parameters.

Disentanglement

Paper
Add Code

OVTrack: Open-Vocabulary Multiple Object Tracking

1 code implementation • CVPR 2023 • Siyuan Li, Tobias Fischer, Lei Ke, Henghui Ding, Martin Danelljan, Fisher Yu

This leaves contemporary MOT methods limited to a small set of pre-defined object categories.

Denoising Hallucination +4

Paper
Code

Mask-Free Video Instance Segmentation

1 code implementation • CVPR 2023 • Lei Ke, Martin Danelljan, Henghui Ding, Yu-Wing Tai, Chi-Keung Tang, Fisher Yu

A consistency loss is then enforced on the found matches.

Ranked #1 on Video Instance Segmentation on Youtube-VIS (trained with no video masks)

Instance Segmentation Optical Flow Estimation +4

350

Paper
Code

NeRF-GAN Distillation for Efficient 3D-Aware Generation with Convolutions

1 code implementation • 22 Mar 2023 • Mohamad Shahbazi, Evangelos Ntavelis, Alessio Tonioni, Edo Collins, Danda Pani Paudel, Martin Danelljan, Luc van Gool

Pose-conditioned convolutional generative models struggle with high-quality 3D-consistent image generation from single-view datasets, due to their lack of sufficient 3D priors.

Image Generation Inductive Bias

Paper
Code

How Reliable is Your Regression Model's Uncertainty Under Real-World Distribution Shifts?

1 code implementation • 7 Feb 2023 • Fredrik K. Gustafsson, Martin Danelljan, Thomas B. Schön

We then employ our benchmark to evaluate many of the most common uncertainty estimation methods, as well as two state-of-the-art uncertainty scores from the task of out-of-distribution detection.

Out-of-Distribution Detection regression

Paper
Code

Continuous Pseudo-Label Rectified Domain Adaptive Semantic Segmentation With Implicit Neural Representations

no code implementations • CVPR 2023 • Rui Gong, Qin Wang, Martin Danelljan, Dengxin Dai, Luc van Gool

Unsupervised domain adaptation (UDA) for semantic segmentation aims at improving the model performance on the unlabeled target domain by leveraging a labeled source domain.

Pseudo Label Semantic Segmentation +1

Paper
Add Code

Beyond SOT: Tracking Multiple Generic Objects at Once

1 code implementation • 22 Dec 2022 • Christoph Mayer, Martin Danelljan, Ming-Hsuan Yang, Vittorio Ferrari, Luc van Gool, Alina Kuznetsova

Our approach achieves a 4x faster run-time in case of 10 concurrent objects compared to tracking each object independently and outperforms existing single object trackers on our new benchmark.

Attribute Object +1

3,086

Paper
Code

Fast Hierarchical Learning for Few-Shot Object Detection

no code implementations • 10 Oct 2022 • Yihang She, Goutam Bhat, Martin Danelljan, Fisher Yu

These approaches however suffer from ``catastrophic forgetting'' issue due to finetuning of base detector, leading to sub-optimal performance on the base classes.

Few-Shot Object Detection Object +2

Paper
Add Code

ManiFlow: Implicitly Representing Manifolds with Normalizing Flows

no code implementations • 18 Aug 2022 • Janis Postels, Martin Danelljan, Luc van Gool, Federico Tombari

In contrast to prior work, we approach this problem by generating samples from the original data distribution given full knowledge about the perturbed distribution and the noise model.

Surface Reconstruction

Paper
Add Code

AVisT: A Benchmark for Visual Object Tracking in Adverse Visibility

1 code implementation • 14 Aug 2022 • Mubashir Noman, Wafa Al Ghallabi, Daniya Najiha, Christoph Mayer, Akshay Dudhane, Martin Danelljan, Hisham Cholakkal, Salman Khan, Luc van Gool, Fahad Shahbaz Khan

While being greatly benefiting to the tracking research, existing benchmarks do not pose the same difficulty as before with recent trackers achieving higher performance mainly due to (i) the introduction of more sophisticated transformers-based methods and (ii) the lack of diverse scenarios with adverse visibility such as, severe weather conditions, camouflage and imaging effects.

Visual Object Tracking Visual Tracking

3,086

Paper
Code

Video Mask Transfiner for High-Quality Video Instance Segmentation

1 code implementation • 28 Jul 2022 • Lei Ke, Henghui Ding, Martin Danelljan, Yu-Wing Tai, Chi-Keung Tang, Fisher Yu

While Video Instance Segmentation (VIS) has seen rapid progress, current approaches struggle to predict high-quality masks with accurate boundary details.

Ranked #1 on Video Instance Segmentation on HQ-YTVIS

Instance Segmentation Semantic Segmentation +2

Paper
Code

Tracking Every Thing in the Wild

1 code implementation • 26 Jul 2022 • Siyuan Li, Martin Danelljan, Henghui Ding, Thomas E. Huang, Fisher Yu

Our experiments show that TETA evaluates trackers more comprehensively, and TETer achieves significant improvements on the challenging large-scale datasets BDD100K and TAO compared to the state-of-the-art.

Ranked #4 on Multi-Object Tracking on TAO

Benchmarking Classification +2

Paper
Code

Arbitrary-Scale Image Synthesis

1 code implementation • CVPR 2022 • Evangelos Ntavelis, Mohamad Shahbazi, Iason Kastanis, Radu Timofte, Martin Danelljan, Luc van Gool

Positional encodings have enabled recent works to train a single adversarial network that can generate images of different scales.

Image Generation

Paper
Code

Robust Visual Tracking by Segmentation

2 code implementations • 21 Mar 2022 • Matthieu Paul, Martin Danelljan, Christoph Mayer, Luc van Gool

We infer a bounding box from the segmentation mask, validate our tracker on challenging tracking datasets and achieve the new state of the art on LaSOT with a success AUC score of 69. 7%.

Segmentation Semantic Segmentation +4

3,086

Paper
Code

Transforming Model Prediction for Tracking

1 code implementation • CVPR 2022 • Christoph Mayer, Martin Danelljan, Goutam Bhat, Matthieu Paul, Danda Pani Paudel, Fisher Yu, Luc van Gool

Optimization based tracking methods have been widely successful by integrating a target model prediction module, providing effective global reasoning by minimizing an objective function.

Ranked #20 on Visual Object Tracking on LaSOT (Precision metric)

Inductive Bias Visual Object Tracking

3,086

Paper
Code

Transform your Smartphone into a DSLR Camera: Learning the ISP in the Wild

no code implementations • 20 Mar 2022 • Ardhendu Shekhar Tripathi, Martin Danelljan, Samarth Shukla, Radu Timofte, Luc van Gool

We propose a trainable Image Signal Processing (ISP) framework that produces DSLR quality images given RAW images captured by a smartphone.

Motion Estimation

Paper
Add Code

Probabilistic Warp Consistency for Weakly-Supervised Semantic Correspondences

1 code implementation • CVPR 2022 • Prune Truong, Martin Danelljan, Fisher Yu, Luc van Gool

We propose Probabilistic Warp Consistency, a weakly-supervised learning objective for semantic matching.

Weakly-supervised Learning

618

Paper
Code

Adiabatic Quantum Computing for Multi Object Tracking

no code implementations • CVPR 2022 • Jan-Nico Zaech, Alexander Liniger, Martin Danelljan, Dengxin Dai, Luc van Gool

Multi-Object Tracking (MOT) is most often approached in the tracking-by-detection paradigm, where object detections are associated through time.

Multi-Object Tracking Object

Paper
Add Code

Fast Online Video Super-Resolution with Deformable Attention Pyramid

no code implementations • 3 Feb 2022 • Dario Fuoli, Martin Danelljan, Radu Timofte, Luc van Gool

Our DAP aligns and integrates information from the recurrent state into the current frame prediction.

Video Super-Resolution

Paper
Add Code

RePaint: Inpainting using Denoising Diffusion Probabilistic Models

3 code implementations • CVPR 2022 • Andreas Lugmayr, Martin Danelljan, Andres Romero, Fisher Yu, Radu Timofte, Luc van Gool

In this work, we propose RePaint: A Denoising Diffusion Probabilistic Model (DDPM) based inpainting approach that is applicable to even extreme masks.

Denoising Image Inpainting

10,821

Paper
Code

Collapse by Conditioning: Training Class-conditional GANs with Limited Data

1 code implementation • ICLR 2022 • Mohamad Shahbazi, Martin Danelljan, Danda Pani Paudel, Luc van Gool

On the contrary, we observe that class-conditioning causes mode collapse in limited data settings, where unconditional learning leads to satisfactory generative ability.

Generative Adversarial Network

Paper
Code

Efficient Visual Tracking with Exemplar Transformers

2 code implementations • 17 Dec 2021 • Philippe Blatter, Menelaos Kanakis, Martin Danelljan, Luc van Gool

E. T. Track, our visual tracker that incorporates Exemplar Transformer modules, runs at 47 FPS on a CPU.

Visual Object Tracking Visual Tracking

3,086

Paper
Code

Visual Object Tracking with Discriminative Filters and Siamese Networks: A Survey and Outlook

no code implementations • 6 Dec 2021 • Sajid Javed, Martin Danelljan, Fahad Shahbaz Khan, Muhammad Haris Khan, Michael Felsberg, Jiri Matas

Accurate and robust visual object tracking is one of the most challenging and fundamental computer vision problems.

Visual Object Tracking Visual Tracking

Paper
Add Code

Mask Transfiner for High-Quality Instance Segmentation

2 code implementations • CVPR 2022 • Lei Ke, Martin Danelljan, Xia Li, Yu-Wing Tai, Chi-Keung Tang, Fisher Yu

Instead of operating on regular dense tensors, our Mask Transfiner decomposes and represents the image regions as a quadtree.

Ranked #1 on Instance Segmentation on BDD100K val

Instance Segmentation Segmentation +2

520

Paper
Code

Normalizing Flow as a Flexible Fidelity Objective for Photo-Realistic Super-resolution

no code implementations • 5 Nov 2021 • Andreas Lugmayr, Martin Danelljan, Fisher Yu, Luc van Gool, Radu Timofte

Super-resolution is an ill-posed problem, where a ground-truth high-resolution image represents only one possibility in the space of plausible solutions.

Super-Resolution

Paper
Add Code

Learning Proposals for Practical Energy-Based Regression

1 code implementation • 22 Oct 2021 • Fredrik K. Gustafsson, Martin Danelljan, Thomas B. Schön

Energy-based models (EBMs) have experienced a resurgence within machine learning in recent years, including as a promising alternative for probabilistic regression.

regression

Paper
Code

Dense Gaussian Processes for Few-Shot Segmentation

1 code implementation • 7 Oct 2021 • Joakim Johnander, Johan Edstedt, Michael Felsberg, Fahad Shahbaz Khan, Martin Danelljan

Given the support set, our dense GP learns the mapping from local deep image features to mask values, capable of capturing complex appearance distributions.

Ranked #1 on Few-Shot Semantic Segmentation on COCO-20i (10-shot)

Few-Shot Semantic Segmentation Gaussian Processes +1

Paper
Code

PDC-Net+: Enhanced Probabilistic Dense Correspondence Network

1 code implementation • 28 Sep 2021 • Prune Truong, Martin Danelljan, Radu Timofte, Luc van Gool

In order to apply dense methods to real-world applications, such as pose estimation, image manipulation, or 3D reconstruction, it is therefore crucial to estimate the confidence of the predicted matches.

3D Reconstruction Geometric Matching +6

618

Paper
Code

TACS: Taxonomy Adaptive Cross-Domain Semantic Segmentation

1 code implementation • 10 Sep 2021 • Rui Gong, Martin Danelljan, Dengxin Dai, Danda Pani Paudel, Ajad Chhatkuli, Fisher Yu, Luc van Gool

In many real-world settings, the target domain task requires a different taxonomy than the one imposed by the source domain.

Contrastive Learning Domain Adaptation +1

Paper
Code

Deep Reparametrization of Multi-Frame Super-Resolution and Denoising

2 code implementations • ICCV 2021 • Goutam Bhat, Martin Danelljan, Fisher Yu, Luc van Gool, Radu Timofte

The deep reparametrization allows us to directly model the image formation process in the latent space, and to integrate learned image priors into the prediction.

Ranked #4 on Burst Image Super-Resolution on BurstSR

Burst Image Super-Resolution Denoising +2

169

Paper
Code

Hierarchical Conditional Flow: A Unified Framework for Image Super-Resolution and Image Rescaling

1 code implementation • ICCV 2021 • Jingyun Liang, Andreas Lugmayr, Kai Zhang, Martin Danelljan, Luc van Gool, Radu Timofte

More specifically, HCFlow learns a bijective mapping between HR and LR image pairs by modelling the distribution of the LR image and the rest high-frequency component simultaneously.

Ranked #25 on Video Super-Resolution on MSU Video Super Resolution Benchmark: Detail Restoration

Image Super-Resolution Video Super-Resolution

186

Paper
Code

Prototypical Cross-Attention Networks for Multiple Object Tracking and Segmentation

1 code implementation • NeurIPS 2021 • Lei Ke, Xia Li, Martin Danelljan, Yu-Wing Tai, Chi-Keung Tang, Fisher Yu

We propose Prototypical Cross-Attention Network (PCAN), capable of leveraging rich spatio-temporal information for online multiple object tracking and segmentation.

Ranked #1 on Video Instance Segmentation on BDD100K val

Multi-Object Tracking and Segmentation Multiple Object Track and Segmentation +3

359

Paper
Code

NTIRE 2021 Challenge on Burst Super-Resolution: Methods and Results

no code implementations • 7 Jun 2021 • Goutam Bhat, Martin Danelljan, Radu Timofte, Kazutoshi Akita, Wooyeong Cho, Haoqiang Fan, Lanpeng Jia, Daeshik Kim, Bruno Lecouat, Youwei Li, Shuaicheng Liu, Ziluan Liu, Ziwei Luo, Takahiro Maeda, Julien Mairal, Christian Micheloni, Xuan Mo, Takeru Oba, Pavel Ostyakov, Jean Ponce, Sanghyeok Son, Jian Sun, Norimichi Ukita, Rao Muhammad Umer, Youliang Yan, Lei Yu, Magauiya Zhussip, Xueyi Zou

This paper reviews the NTIRE2021 challenge on burst super-resolution.

Super-Resolution

Paper
Add Code

Learnable Online Graph Representations for 3D Multi-Object Tracking

no code implementations • 23 Apr 2021 • Jan-Nico Zaech, Dengxin Dai, Alexander Liniger, Martin Danelljan, Luc van Gool

Tracking of objects in 3D is a fundamental task in computer vision that finds use in a wide range of applications such as autonomous driving, robotics or augmented reality.

3D Multi-Object Tracking Autonomous Driving

Paper
Add Code

Warp Consistency for Unsupervised Learning of Dense Correspondences

1 code implementation • ICCV 2021 • Prune Truong, Martin Danelljan, Fisher Yu, Luc van Gool

From our observations and empirical results, we design a general unsupervised objective employing two of the derived constraints.

Dense Pixel Correspondence Estimation

618

Paper
Code

Deep Gaussian Processes for Few-Shot Segmentation

no code implementations • 30 Mar 2021 • Joakim Johnander, Johan Edstedt, Martin Danelljan, Michael Felsberg, Fahad Shahbaz Khan

Through the expressivity of the GP, our approach is capable of modeling complex appearance distributions in the deep feature space.

Gaussian Processes Segmentation

Paper
Add Code

Learning Target Candidate Association to Keep Track of What Not to Track

1 code implementation • ICCV 2021 • Christoph Mayer, Martin Danelljan, Danda Pani Paudel, Luc van Gool

To tackle the problem of lacking ground-truth correspondences between distractor objects in visual tracking, we propose a training strategy that combines partial annotations with self-supervision.

Ranked #4 on Visual Object Tracking on OTB-2015

Visual Object Tracking Visual Tracking

3,086

Paper
Code

Deep Burst Super-Resolution

3 code implementations • CVPR 2021 • Goutam Bhat, Martin Danelljan, Luc van Gool, Radu Timofte

We propose a novel architecture for the burst super-resolution task.

Ranked #6 on Burst Image Super-Resolution on SyntheticBurst

Burst Image Super-Resolution Multi-Frame Super-Resolution +1

169

Paper
Code

DeFlow: Learning Complex Image Degradations from Unpaired Data with Conditional Flows

2 code implementations • CVPR 2021 • Valentin Wolf, Andreas Lugmayr, Martin Danelljan, Luc van Gool, Radu Timofte

We propose DeFlow, a method for learning stochastic image degradations from unpaired data.

Image Restoration Super-Resolution

Paper
Code

Generating Masks from Boxes by Mining Spatio-Temporal Consistencies in Videos

1 code implementation • ICCV 2021 • Bin Zhao, Goutam Bhat, Martin Danelljan, Luc van Gool, Radu Timofte

This effectively limits the performance and generalization capabilities of existing video segmentation methods.

Object Segmentation +4

3,086

Paper
Code

Local Memory Attention for Fast Video Semantic Segmentation

1 code implementation • 5 Jan 2021 • Matthieu Paul, Martin Danelljan, Luc van Gool, Radu Timofte

Our approach aggregates a rich representation of the semantic information in past frames into a memory module.

Segmentation Semantic Segmentation +1

Paper
Code

Learning Accurate Dense Correspondences and When to Trust Them

4 code implementations • CVPR 2021 • Prune Truong, Martin Danelljan, Luc van Gool, Radu Timofte

Establishing dense correspondences between a pair of images is an important and general problem.

3D Reconstruction Dense Pixel Correspondence Estimation +4

619

Paper
Code

Scaling Semantic Segmentation Beyond 1K Classes on a Single GPU

1 code implementation • ICCV 2021 • Shipra Jain, Danda Paudel Pani, Martin Danelljan, Luc van Gool

In this paper, we propose a novel training methodology to train and scale the existing semantic segmentation models for a large number of semantic classes without increasing the memory overhead.

Image Classification object-detection +3

Paper
Code

Accurate 3D Object Detection using Energy-Based Models

1 code implementation • 8 Dec 2020 • Fredrik K. Gustafsson, Martin Danelljan, Thomas B. Schön

On the KITTI dataset, our proposed approach consistently outperforms the SA-SSD baseline across all 3DOD metrics, demonstrating the potential of EBM-based regression for highly accurate 3DOD.

Ranked #1 on 3D Object Detection on KITTI Cars Easy val

3D Object Detection Object +2

Paper
Code

Learning Video Instance Segmentation with Recurrent Graph Neural Networks

no code implementations • 7 Dec 2020 • Joakim Johnander, Emil Brissman, Martin Danelljan, Michael Felsberg

Most existing approaches to video instance segmentation comprise multiple modules that are heuristically combined to produce the final output.

Instance Segmentation Management +3

Paper
Add Code

Fast Few-Shot Classification by Few-Iteration Meta-Learning

1 code implementation • 1 Oct 2020 • Ardhendu Shekhar Tripathi, Martin Danelljan, Luc van Gool, Radu Timofte

By employing an efficient initialization module and a Steepest Descent based optimization algorithm, our base learner predicts a powerful classifier within only a few iterations.

Classification General Classification +3

Paper
Code

GOCor: Bringing Globally Optimized Correspondence Volumes into Your Neural Network

2 code implementations • NeurIPS 2020 • Prune Truong, Martin Danelljan, Luc van Gool, Radu Timofte

We propose GOCor, a fully differentiable dense matching module, acting as a direct replacement to the feature correlation layer.

Dense Pixel Correspondence Estimation Feature Correlation +2

219

Paper
Code

AIM 2020 Challenge on Efficient Super-Resolution: Methods and Results

3 code implementations • 15 Sep 2020 • Kai Zhang, Martin Danelljan, Yawei Li, Radu Timofte, Jie Liu, Jie Tang, Gangshan Wu, Yu Zhu, Xiangyu He, Wenjie Xu, Chenghua Li, Cong Leng, Jian Cheng, Guangyang Wu, Wenyi Wang, Xiaohong Liu, Hengyuan Zhao, Xiangtao Kong, Jingwen He, Yu Qiao, Chao Dong, Maitreya Suin, Kuldeep Purohit, A. N. Rajagopalan, Xiaochuan Li, Zhiqiang Lang, Jiangtao Nie, Wei Wei, Lei Zhang, Abdul Muqeet, Jiwon Hwang, Subin Yang, JungHeum Kang, Sung-Ho Bae, Yongwoo Kim, Geun-Woo Jeon, Jun-Ho Choi, Jun-Hyuk Kim, Jong-Seok Lee, Steven Marty, Eric Marty, Dongliang Xiong, Siang Chen, Lin Zha, Jiande Jiang, Xinbo Gao, Wen Lu, Haicheng Wang, Vineeth Bhaskara, Alex Levinshtein, Stavros Tsogkas, Allan Jepson, Xiangzhen Kong, Tongtong Zhao, Shanshan Zhao, Hrishikesh P. S, Densen Puthussery, Jiji C. V, Nan Nan, Shuai Liu, Jie Cai, Zibo Meng, Jiaming Ding, Chiu Man Ho, Xuehui Wang, Qiong Yan, Yuzhi Zhao, Long Chen, Jiangtao Zhang, Xiaotong Luo, Liang Chen, Yanyun Qu, Long Sun, Wenhao Wang, Zhenbing Liu, Rushi Lan, Rao Muhammad Umer, Christian Micheloni

This paper reviews the AIM 2020 challenge on efficient single image super-resolution with focus on the proposed solutions and results.

Image Super-Resolution

2,713

Paper
Code

DeepSVG: A Hierarchical Generative Network for Vector Graphics Animation

1 code implementation • NeurIPS 2020 • Alexandre Carlier, Martin Danelljan, Alexandre Alahi, Radu Timofte

Scalable Vector Graphics (SVG) are ubiquitous in modern 2D interfaces due to their ability to scale to different resolutions.

Ranked #1 on Vector Graphics Animation on SVG-Icons8

Representation Learning Vector Graphics +1

876

Paper
Code

Video Object Segmentation with Episodic Graph Memory Networks

1 code implementation • ECCV 2020 • Xiankai Lu, Wenguan Wang, Martin Danelljan, Tianfei Zhou, Jianbing Shen, Luc van Gool

How to make a segmentation model efficiently adapt to a specific video and to online target appearance variations are fundamentally crucial issues in the field of video object segmentation.

Object Segmentation +4

Paper
Code

The Heterogeneity Hypothesis: Finding Layer-Wise Differentiated Network Architectures

1 code implementation • CVPR 2021 • Yawei Li, Wen Li, Martin Danelljan, Kai Zhang, Shuhang Gu, Luc van Gool, Radu Timofte

Based on that, we articulate the heterogeneity hypothesis: with the same training protocol, there exists a layer-wise differentiated network architecture (LW-DNA) that can outperform the original network with regular channel configurations but with a lower level of model complexity.

Image Classification Image Restoration +1

Paper
Code

SRFlow: Learning the Super-Resolution Space with Normalizing Flow

7 code implementations • ECCV 2020 • Andreas Lugmayr, Martin Danelljan, Luc van Gool, Radu Timofte

SRFlow therefore directly accounts for the ill-posed nature of the problem, and learns to predict diverse photo-realistic high-resolution images.

Ranked #6 on Image Super-Resolution on DIV2K val - 4x upscaling (using extra training data)

Image Manipulation Image Super-Resolution

821

Paper
Code

NTIRE 2020 Challenge on Video Quality Mapping: Methods and Results

no code implementations • 5 May 2020 • Dario Fuoli, Zhiwu Huang, Martin Danelljan, Radu Timofte, Hua Wang, Longcun Jin, Dewei Su, Jing Liu, Jaehoon Lee, Michal Kudelski, Lukasz Bala, Dmitry Hrybov, Marcin Mozejko, Muchen Li, Si-Yao Li, Bo Pang, Cewu Lu, Chao Li, Dongliang He, Fu Li, Shilei Wen

For track 2, some existing methods are evaluated, showing promising solutions to the weakly-supervised video quality mapping problem.

Paper
Add Code

NTIRE 2020 Challenge on Real-World Image Super-Resolution: Methods and Results

5 code implementations • 5 May 2020 • Andreas Lugmayr, Martin Danelljan, Radu Timofte, Namhyuk Ahn, Dongwoon Bai, Jie Cai, Yun Cao, Junyang Chen, Kaihua Cheng, SeYoung Chun, Wei Deng, Mostafa El-Khamy, Chiu Man Ho, Xiaozhong Ji, Amin Kheradmand, Gwantae Kim, Hanseok Ko, Kanghyu Lee, Jungwon Lee, Hao Li, Ziluan Liu, Zhi-Song Liu, Shuai Liu, Yunhua Lu, Zibo Meng, Pablo Navarrete Michelini, Christian Micheloni, Kalpesh Prajapati, Haoyu Ren, Yong Hyeok Seo, Wan-Chi Siu, Kyung-Ah Sohn, Ying Tai, Rao Muhammad Umer, Shuangquan Wang, Huibing Wang, Timothy Haoning Wu, Hao-Ning Wu, Biao Yang, Fuzhi Yang, Jaejun Yoo, Tongtong Zhao, Yuanbo Zhou, Haijie Zhuo, Ziyao Zong, Xueyi Zou

This paper reviews the NTIRE 2020 challenge on real world super-resolution.

Benchmarking Image Super-Resolution

1,049

Paper
Code

How to Train Your Energy-Based Model for Regression

1 code implementation • 4 May 2020 • Fredrik K. Gustafsson, Martin Danelljan, Radu Timofte, Thomas B. Schön

While they are commonly employed for generative image modeling, recent work has applied EBMs also for regression tasks, achieving state-of-the-art performance on object detection and visual tracking.

Ranked #1 on Visual Object Tracking on OTB-100

object-detection Object Detection +3

Paper
Code

Learning Human-Object Interaction Detection using Interaction Points

1 code implementation • CVPR 2020 • Tiancai Wang, Tong Yang, Martin Danelljan, Fahad Shahbaz Khan, Xiangyu Zhang, Jian Sun

Human-object interaction (HOI) detection strives to localize both the human and an object as well as the identification of complex interactions between them.

Human-Object Interaction Detection Keypoint Detection +2

Paper
Code

Probabilistic Regression for Visual Tracking

2 code implementations • CVPR 2020 • Martin Danelljan, Luc van Gool, Radu Timofte

In this work, we therefore propose a probabilistic regression formulation and apply it to tracking.

Ranked #4 on Object Tracking on FE108

Object Tracking regression +1

3,375

Paper
Code

Learning What to Learn for Video Object Segmentation

2 code implementations • ECCV 2020 • Goutam Bhat, Felix Järemo Lawin, Martin Danelljan, Andreas Robinson, Michael Felsberg, Luc van Gool, Radu Timofte

This allows us to achieve a rich internal representation of the target in the current frame, significantly increasing the segmentation accuracy of our approach.

Ranked #13 on Semi-Supervised Video Object Segmentation on DAVIS (no YouTube-VOS training)

Few-Shot Learning Object +4

3,086

Paper
Code

Know Your Surroundings: Exploiting Scene Information for Object Tracking

1 code implementation • ECCV 2020 • Goutam Bhat, Martin Danelljan, Luc van Gool, Radu Timofte

Such approaches are however prone to fail in case of e. g. fast appearance changes or presence of distractor objects, where a target appearance model alone is insufficient for robust tracking.

Ranked #8 on Object Tracking on FE108

Object Tracking

3,086

Paper
Code

Learning Fast and Robust Target Models for Video Object Segmentation

2 code implementations • CVPR 2020 • Andreas Robinson, Felix Järemo Lawin, Martin Danelljan, Fahad Shahbaz Khan, Michael Felsberg

The target appearance model consists of a light-weight module, which is learned during the inference stage using fast optimization techniques to predict a coarse but robust target segmentation.

Ranked #20 on Semi-Supervised Video Object Segmentation on DAVIS (no YouTube-VOS training)

One-shot visual object segmentation Segmentation +2

123

Paper
Code

GLU-Net: Global-Local Universal Network for Dense Flow and Correspondences

2 code implementations • CVPR 2020 • Prune Truong, Martin Danelljan, Radu Timofte

Establishing dense correspondences between a pair of images is an important and general problem, covering geometric matching, optical flow and semantic correspondences.

Dense Pixel Correspondence Estimation Geometric Matching +1

219

Paper
Code

AIM 2019 Challenge on Real-World Image Super-Resolution: Methods and Results

1 code implementation • 18 Nov 2019 • Andreas Lugmayr, Martin Danelljan, Radu Timofte, Manuel Fritsche, Shuhang Gu, Kuldeep Purohit, Praveen Kandula, Maitreya Suin, A. N. Rajagopalan, Nam Hyung Joon, Yu Seung Won, Guisik Kim, Dokyeong Kwon, Chih-Chung Hsu, Chia-Hsiang Lin, Yuanfei Huang, Xiaopeng Sun, Wen Lu, Jie Li, Xinbo Gao, Sefi Bell-Kligler

For training, only one set of source input images is therefore provided in the challenge.

Image Super-Resolution

158

Paper
Code

Energy-Based Models for Deep Probabilistic Regression

1 code implementation • ECCV 2020 • Fredrik K. Gustafsson, Martin Danelljan, Goutam Bhat, Thomas B. Schön

In our proposed approach, we create an energy-based model of the conditional target density p(y|x), using a deep neural network to predict the un-normalized density from (x, y).

Ranked #1 on Object Detection on COCO test-dev (Hardware Burden metric)

Head Pose Estimation object-detection +4

Paper
Code

Unsupervised Learning for Real-World Super-Resolution

no code implementations • 20 Sep 2019 • Andreas Lugmayr, Martin Danelljan, Radu Timofte

Instead of directly addressing this problem, most works employ the popular bicubic downsampling strategy to artificially generate a corresponding low resolution image.

Image Super-Resolution

Paper
Add Code

Multi-Modal Fusion for End-to-End RGB-T Tracking

1 code implementation • 30 Aug 2019 • Lichao Zhang, Martin Danelljan, Abel Gonzalez-Garcia, Joost Van de Weijer, Fahad Shahbaz Khan

Our tracker is trained in an end-to-end manner, enabling the components to learn how to fuse the information from both modalities.

Ranked #7 on Rgb-T Tracking on RGBT210

Image-to-Image Translation Rgb-T Tracking

Paper
Code

Learning the Model Update for Siamese Trackers

1 code implementation • ICCV 2019 • Lichao Zhang, Abel Gonzalez-Garcia, Joost Van de Weijer, Martin Danelljan, Fahad Shahbaz Khan

In general, this template is linearly combined with the accumulated template from the previous frame, resulting in an exponential decay of information over time.

Visual Tracking

134

Paper
Code

Evaluating Scalable Bayesian Deep Learning Methods for Robust Computer Vision

1 code implementation • 4 Jun 2019 • Fredrik K. Gustafsson, Martin Danelljan, Thomas B. Schön

We therefore accept this task and propose a comprehensive evaluation framework for scalable epistemic uncertainty estimation methods in deep learning.

Depth Completion Semantic Segmentation

128

Paper
Code

Discriminative Online Learning for Fast Video Object Segmentation

no code implementations • 18 Apr 2019 • Andreas Robinson, Felix Järemo Lawin, Martin Danelljan, Fahad Shahbaz Khan, Michael Felsberg

We propose a novel approach, based on a dedicated target appearance model that is exclusively learned online to discriminate between the target and background image regions.

Object One-shot visual object segmentation +4

Paper
Add Code

Learning Discriminative Model Prediction for Tracking

2 code implementations • ICCV 2019 • Goutam Bhat, Martin Danelljan, Luc van Gool, Radu Timofte

The current strive towards end-to-end trainable computer vision systems imposes major challenges for the task of visual tracking.

Ranked #5 on Object Tracking on FE108

Visual Object Tracking Visual Tracking

3,086

Paper
Code

A Generative Appearance Model for End-to-end Video Object Segmentation

1 code implementation • CVPR 2019 • Joakim Johnander, Martin Danelljan, Emil Brissman, Fahad Shahbaz Khan, Michael Felsberg

One of the fundamental challenges in video object segmentation is to find an effective representation of the target and background appearance.

Ranked #52 on Semi-Supervised Video Object Segmentation on DAVIS 2017 (test-dev)

One-shot visual object segmentation Segmentation +2

Paper
Code

ATOM: Accurate Tracking by Overlap Maximization

3 code implementations • CVPR 2019 • Martin Danelljan, Goutam Bhat, Fahad Shahbaz Khan, Michael Felsberg

We argue that this approach is fundamentally limited since target estimation is a complex task, requiring high-level knowledge about the object.

Ranked #7 on Object Tracking on FE108

General Classification Visual Object Tracking +1

3,086

Paper
Code

Synthetic data generation for end-to-end thermal infrared tracking

no code implementations • 4 Jun 2018 • Lichao Zhang, Abel Gonzalez-Garcia, Joost Van de Weijer, Martin Danelljan, Fahad Shahbaz Khan

These methods provide us with a large labeled dataset of synthetic TIR sequences, on which we can train end-to-end optimal features for tracking.

Image-to-Image Translation Synthetic Data Generation +2

Paper
Add Code

Unveiling the Power of Deep Tracking

no code implementations • ECCV 2018 • Goutam Bhat, Joakim Johnander, Martin Danelljan, Fahad Shahbaz Khan, Michael Felsberg

In the field of generic object tracking numerous attempts have been made to exploit deep features.

Object Tracking

Paper
Add Code

Density Adaptive Point Set Registration

1 code implementation • CVPR 2018 • Felix Järemo Lawin, Martin Danelljan, Fahad Shahbaz Khan, Per-Erik Forssén, Michael Felsberg

Contrary to previous works, we model the underlying structure of the scene as a latent probability distribution, and thereby induce invariance to point set density changes.

Paper
Code

DCCO: Towards Deformable Continuous Convolution Operators

no code implementations • 9 Jun 2017 • Joakim Johnander, Martin Danelljan, Fahad Shahbaz Khan, Michael Felsberg

Generally, DCF based trackers learn a rigid appearance model of the target.

Paper
Add Code

Deep Projective 3D Semantic Segmentation

1 code implementation • 9 May 2017 • Felix Järemo Lawin, Martin Danelljan, Patrik Tosteberg, Goutam Bhat, Fahad Shahbaz Khan, Michael Felsberg

Recent attempts, based on 3D deep learning approaches (3D-CNNs), have achieved below-expected results.

Ranked #15 on Semantic Segmentation on Semantic3D

Segmentation

387

Paper
Code

Deep Motion Features for Visual Tracking

no code implementations • 20 Dec 2016 • Susanna Gladh, Martin Danelljan, Fahad Shahbaz Khan, Michael Felsberg

To the best of our knowledge, we are the first to propose fusing appearance information with deep motion features for visual tracking.

Action Recognition Optical Flow Estimation +3

Paper
Add Code

ECO: Efficient Convolution Operators for Tracking

5 code implementations • CVPR 2017 • Martin Danelljan, Goutam Bhat, Fahad Shahbaz Khan, Michael Felsberg

Moreover, our fast variant, using hand-crafted features, operates at 60 Hz on a single CPU, while obtaining 65. 0% AUC on OTB-2015.

Ranked #13 on Visual Object Tracking on VOT2017/18

Visual Object Tracking

610

Paper
Code

Discriminative Scale Space Tracking

no code implementations • 20 Sep 2016 • Martin Danelljan, Gustav Häger, Fahad Shahbaz Khan, Michael Felsberg

Compared to the standard exhaustive scale search, our approach achieves a gain of 2. 5% in average overlap precision on the OTB dataset.

Visual Object Tracking

Paper
Add Code

Adaptive Decontamination of the Training Set: A Unified Formulation for Discriminative Visual Tracking

no code implementations • CVPR 2016 • Martin Danelljan, Gustav Häger, Fahad Shahbaz Khan, Michael Felsberg

We propose a novel generic approach for alleviating the problem of corrupted training samples in tracking-by-detection frameworks.

Visual Tracking

Paper
Add Code

Learning Spatially Regularized Correlation Filters for Visual Tracking

no code implementations • ICCV 2015 • Martin Danelljan, Gustav Häger, Fahad Shahbaz Khan, Michael Felsberg

These methods utilize a periodic assumption of the training samples to efficiently learn a classifier on all patches in the target neighborhood.

Visual Tracking

Paper
Add Code

Beyond Correlation Filters: Learning Continuous Convolution Operators for Visual Tracking

1 code implementation • 12 Aug 2016 • Martin Danelljan, Andreas Robinson, Fahad Shahbaz Khan, Michael Felsberg

We also demonstrate the effectiveness of our learning formulation in extensive feature point tracking experiments.

Point Tracking Visual Object Tracking

193

Paper
Code

A Probabilistic Framework for Color-Based Point Set Registration

no code implementations • CVPR 2016 • Martin Danelljan, Giulia Meneghetti, Fahad Shahbaz Khan, Michael Felsberg

On the Stanford Lounge dataset, our approach achieves a relative reduction of the failure rate by 78% compared to the baseline.

Paper
Add Code

Adaptive Color Attributes for Real-Time Visual Tracking

no code implementations • CVPR 2014 • Martin Danelljan, Fahad Shahbaz Khan, Michael Felsberg, Joost Van de Weijer

This paper investigates the contribution of color in a tracking-by-detection framework.

Attribute Object Recognition +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.