Search Results for author: Anna Khoreva

Found 25 papers, 14 papers with code

VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis

1 code implementation • 20 Mar 2024 • Yumeng Li, William Beluch, Margret Keuper, Dan Zhang, Anna Khoreva

Despite tremendous progress in the field of text-to-video (T2V) synthesis, open-sourced T2V diffusion models struggle to generate longer videos with dynamically varying and evolving content.

Generative Temporal Nursing Text-to-Video Generation +1

Paper
Code

Adversarial Supervision Makes Layout-to-Image Diffusion Models Thrive

1 code implementation • 16 Jan 2024 • Yumeng Li, Margret Keuper, Dan Zhang, Anna Khoreva

Current L2I models either suffer from poor editability via text or weak alignment between the generated image and the input layout.

Domain Generalization Layout-to-Image Generation +1

Paper
Code

Divide & Bind Your Attention for Improved Generative Semantic Nursing

1 code implementation • 20 Jul 2023 • Yumeng Li, Margret Keuper, Dan Zhang, Anna Khoreva

To address the challenges posed by complex prompts or scenarios involving multiple entities and to achieve improved attribute binding, we propose Divide & Bind.

Attribute Generative Semantic Nursing +1

Paper
Code

Intra- & Extra-Source Exemplar-Based Style Synthesis for Improved Domain Generalization

1 code implementation • 2 Jul 2023 • Yumeng Li, Dan Zhang, Margret Keuper, Anna Khoreva

Using the proposed masked noise encoder to randomize style and content combinations in the training set, i. e., intra-source style augmentation (ISSA) effectively increases the diversity of training data and reduces spurious correlation.

Autonomous Driving Data Augmentation +3

Paper
Code

Discovering Class-Specific GAN Controls for Semantic Image Synthesis

no code implementations • 2 Dec 2022 • Edgar Schönfeld, Julio Borges, Vadim Sushko, Bernt Schiele, Anna Khoreva

Prior work has extensively studied the latent space structure of GANs for unconditional image synthesis, enabling global editing of generated images by the unsupervised discovery of interpretable latent directions.

Image Generation

Paper
Add Code

Intra-Source Style Augmentation for Improved Domain Generalization

1 code implementation • 18 Oct 2022 • Yumeng Li, Dan Zhang, Margret Keuper, Anna Khoreva

Using the proposed masked noise encoder to randomize style and content combinations in the training set, ISSA effectively increases the diversity of training data and reduces spurious correlation.

Autonomous Driving Domain Generalization +1

Paper
Code

One-Shot Synthesis of Images and Segmentation Masks

1 code implementation • 15 Sep 2022 • Vadim Sushko, Dan Zhang, Juergen Gall, Anna Khoreva

To this end, inspired by the recent architectural developments of single-image GANs, we introduce our OSMIS model which enables the synthesis of segmentation masks that are precisely aligned to the generated images in the one-shot regime.

Data Augmentation Image Generation +2

Paper
Code

Learning to Generate Novel Scene Compositions from Single Images and Videos

1 code implementation • 12 May 2021 • Vadim Sushko, Juergen Gall, Anna Khoreva

Training GANs in low-data regimes remains a challenge, as overfitting often leads to memorization or training divergence.

Memorization

Paper
Code

Generating Novel Scene Compositions from Single Images and Videos

1 code implementation • 24 Mar 2021 • Vadim Sushko, Dan Zhang, Juergen Gall, Anna Khoreva

In this work, we introduce SIV-GAN, an unconditional generative model that can generate new scene compositions from a single training image or a single video clip.

Image Generation Memorization

Paper
Code

You Only Need Adversarial Supervision for Semantic Image Synthesis

1 code implementation • ICLR 2021 • Vadim Sushko, Edgar Schönfeld, Dan Zhang, Juergen Gall, Bernt Schiele, Anna Khoreva

By providing stronger supervision to the discriminator as well as to the generator through spatially- and semantically-aware discriminator feedback, we are able to synthesize images of higher fidelity with better alignment to their input label maps, making the use of the perceptual loss superfluous.

Ranked #1 on Image-to-Image Translation on ADE20K-Outdoor Labels-to-Photos

Image-to-Image Translation Semantic Segmentation

315

Paper
Code

Improving Augmentation and Evaluation Schemes for Semantic Image Synthesis

no code implementations • 25 Nov 2020 • Prateek Katiyar, Anna Khoreva

We therefore propose to improve the established semantic image synthesis evaluation scheme by analyzing separately the performance of generated images on the biased and unbiased classes for the given segmentation network.

Ranked #5 on Image-to-Image Translation on ADE20K Labels-to-Photos (Accuracy metric)

Benchmarking Data Augmentation +1

Paper
Add Code

A U-Net Based Discriminator for Generative Adversarial Networks

3 code implementations • 28 Feb 2020 • Edgar Schönfeld, Bernt Schiele, Anna Khoreva

The novel discriminator improves over the state of the art in terms of the standard distribution and image quality metrics, enabling the generator to synthesize images with varying structure, appearance and levels of detail, maintaining global and local realism.

Ranked #1 on Image Generation on CelebA 128x128

Conditional Image Generation Data Augmentation

364

Paper
Code

Grid Saliency for Context Explanations of Semantic Segmentation

2 code implementations • NeurIPS 2019 • Lukas Hoyer, Mauricio Munoz, Prateek Katiyar, Anna Khoreva, Volker Fischer

Recently, there has been a growing interest in developing saliency methods that provide visual explanations of network predictions.

Image Classification Segmentation +1

Paper
Code

Short-Term Prediction and Multi-Camera Fusion on Semantic Grids

no code implementations • 21 Mar 2019 • Lukas Hoyer, Patrick Kesper, Anna Khoreva, Volker Fischer

An environment representation (ER) is a substantial part of every autonomous system.

Decision Making Semantic Segmentation +1

Paper
Add Code

Progressive Augmentation of GANs

1 code implementation • NeurIPS 2019 • Dan Zhang, Anna Khoreva

Training of Generative Adversarial Networks (GANs) is notoriously fragile, requiring to maintain a careful balance between the generator and the discriminator in order to perform well.

Ranked #3 on Image Generation on CelebA-HQ 128x128

Image Generation

Paper
Code

PA-GAN: Improving GAN Training by Progressive Augmentation

no code implementations • 27 Sep 2018 • Dan Zhang, Anna Khoreva

Despite recent progress, Generative Adversarial Networks (GANs) still suffer from training instability, requiring careful consideration of architecture design choices and hyper-parameter tuning.

Image Generation

Paper
Add Code

Learning to Refine Human Pose Estimation

no code implementations • 21 Apr 2018 • Mihai Fieraru, Anna Khoreva, Leonid Pishchulin, Bernt Schiele

Multi-person pose estimation in images and videos is an important yet challenging task with many applications.

Ranked #1 on Multi-Person Pose Estimation and Tracking on PoseTrack2018

Data Augmentation Keypoint Detection +3

Paper
Add Code

Video Object Segmentation with Language Referring Expressions

no code implementations • 21 Mar 2018 • Anna Khoreva, Anna Rohrbach, Bernt Schiele

We show that our language-supervised approach performs on par with the methods which have access to a pixel-level mask of the target object on DAVIS'16 and is competitive to methods using scribbles on the challenging DAVIS'17 dataset.

Ranked #1 on Video Object Segmentation on DAVIS 2017 (mIoU metric)

Object Referring Expression Segmentation +4

Paper
Add Code

Lucid Data Dreaming for Video Object Segmentation

4 code implementations • 28 Mar 2017 • Anna Khoreva, Rodrigo Benenson, Eddy Ilg, Thomas Brox, Bernt Schiele

Our approach is suitable for both single and multiple object segmentation.

Ranked #43 on Semi-Supervised Video Object Segmentation on DAVIS 2017 (test-dev)

Multiple Object Tracking Object +4

125

Paper
Code

Exploiting saliency for object segmentation from image level labels

no code implementations • CVPR 2017 • Seong Joon Oh, Rodrigo Benenson, Anna Khoreva, Zeynep Akata, Mario Fritz, Bernt Schiele

We show how to combine both information sources in order to recover 80% of the fully supervised performance - which is the new state of the art in weakly supervised training for pixel-wise semantic labelling.

Ranked #26 on Semantic Segmentation on PASCAL VOC 2012 val

Object Semantic Segmentation

Paper
Add Code

Learning Video Object Segmentation from Static Images

2 code implementations • CVPR 2017 • Anna Khoreva, Federico Perazzi, Rodrigo Benenson, Bernt Schiele, Alexander Sorkine-Hornung

Inspired by recent advances of deep learning in instance segmentation and object tracking, we introduce video object segmentation problem as a concept of guided instance segmentation.

Ranked #6 on Semi-Supervised Video Object Segmentation on YouTube