Search Results for author: Anna Khoreva

Found 25 papers, 14 papers with code

VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis

1 code implementation20 Mar 2024 Yumeng Li, William Beluch, Margret Keuper, Dan Zhang, Anna Khoreva

Despite tremendous progress in the field of text-to-video (T2V) synthesis, open-sourced T2V diffusion models struggle to generate longer videos with dynamically varying and evolving content.

Generative Temporal Nursing Text-to-Video Generation +1

Adversarial Supervision Makes Layout-to-Image Diffusion Models Thrive

1 code implementation16 Jan 2024 Yumeng Li, Margret Keuper, Dan Zhang, Anna Khoreva

Current L2I models either suffer from poor editability via text or weak alignment between the generated image and the input layout.

Domain Generalization Layout-to-Image Generation +1

Divide & Bind Your Attention for Improved Generative Semantic Nursing

1 code implementation20 Jul 2023 Yumeng Li, Margret Keuper, Dan Zhang, Anna Khoreva

To address the challenges posed by complex prompts or scenarios involving multiple entities and to achieve improved attribute binding, we propose Divide & Bind.

Attribute Generative Semantic Nursing +1

Intra- & Extra-Source Exemplar-Based Style Synthesis for Improved Domain Generalization

1 code implementation2 Jul 2023 Yumeng Li, Dan Zhang, Margret Keuper, Anna Khoreva

Using the proposed masked noise encoder to randomize style and content combinations in the training set, i. e., intra-source style augmentation (ISSA) effectively increases the diversity of training data and reduces spurious correlation.

Autonomous Driving Data Augmentation +3

Discovering Class-Specific GAN Controls for Semantic Image Synthesis

no code implementations2 Dec 2022 Edgar Schönfeld, Julio Borges, Vadim Sushko, Bernt Schiele, Anna Khoreva

Prior work has extensively studied the latent space structure of GANs for unconditional image synthesis, enabling global editing of generated images by the unsupervised discovery of interpretable latent directions.

Image Generation

Intra-Source Style Augmentation for Improved Domain Generalization

1 code implementation18 Oct 2022 Yumeng Li, Dan Zhang, Margret Keuper, Anna Khoreva

Using the proposed masked noise encoder to randomize style and content combinations in the training set, ISSA effectively increases the diversity of training data and reduces spurious correlation.

Autonomous Driving Domain Generalization +1

One-Shot Synthesis of Images and Segmentation Masks

1 code implementation15 Sep 2022 Vadim Sushko, Dan Zhang, Juergen Gall, Anna Khoreva

To this end, inspired by the recent architectural developments of single-image GANs, we introduce our OSMIS model which enables the synthesis of segmentation masks that are precisely aligned to the generated images in the one-shot regime.

Data Augmentation Image Generation +2

Learning to Generate Novel Scene Compositions from Single Images and Videos

1 code implementation12 May 2021 Vadim Sushko, Juergen Gall, Anna Khoreva

Training GANs in low-data regimes remains a challenge, as overfitting often leads to memorization or training divergence.

Memorization

Generating Novel Scene Compositions from Single Images and Videos

1 code implementation24 Mar 2021 Vadim Sushko, Dan Zhang, Juergen Gall, Anna Khoreva

In this work, we introduce SIV-GAN, an unconditional generative model that can generate new scene compositions from a single training image or a single video clip.

Image Generation Memorization

You Only Need Adversarial Supervision for Semantic Image Synthesis

1 code implementation ICLR 2021 Vadim Sushko, Edgar Schönfeld, Dan Zhang, Juergen Gall, Bernt Schiele, Anna Khoreva

By providing stronger supervision to the discriminator as well as to the generator through spatially- and semantically-aware discriminator feedback, we are able to synthesize images of higher fidelity with better alignment to their input label maps, making the use of the perceptual loss superfluous.

Image-to-Image Translation Semantic Segmentation

Improving Augmentation and Evaluation Schemes for Semantic Image Synthesis

no code implementations25 Nov 2020 Prateek Katiyar, Anna Khoreva

We therefore propose to improve the established semantic image synthesis evaluation scheme by analyzing separately the performance of generated images on the biased and unbiased classes for the given segmentation network.

Benchmarking Data Augmentation +1

A U-Net Based Discriminator for Generative Adversarial Networks

3 code implementations28 Feb 2020 Edgar Schönfeld, Bernt Schiele, Anna Khoreva

The novel discriminator improves over the state of the art in terms of the standard distribution and image quality metrics, enabling the generator to synthesize images with varying structure, appearance and levels of detail, maintaining global and local realism.

Conditional Image Generation Data Augmentation

Grid Saliency for Context Explanations of Semantic Segmentation

2 code implementations NeurIPS 2019 Lukas Hoyer, Mauricio Munoz, Prateek Katiyar, Anna Khoreva, Volker Fischer

Recently, there has been a growing interest in developing saliency methods that provide visual explanations of network predictions.

Image Classification Segmentation +1

Progressive Augmentation of GANs

1 code implementation NeurIPS 2019 Dan Zhang, Anna Khoreva

Training of Generative Adversarial Networks (GANs) is notoriously fragile, requiring to maintain a careful balance between the generator and the discriminator in order to perform well.

Image Generation

PA-GAN: Improving GAN Training by Progressive Augmentation

no code implementations27 Sep 2018 Dan Zhang, Anna Khoreva

Despite recent progress, Generative Adversarial Networks (GANs) still suffer from training instability, requiring careful consideration of architecture design choices and hyper-parameter tuning.

Image Generation

Video Object Segmentation with Language Referring Expressions

no code implementations21 Mar 2018 Anna Khoreva, Anna Rohrbach, Bernt Schiele

We show that our language-supervised approach performs on par with the methods which have access to a pixel-level mask of the target object on DAVIS'16 and is competitive to methods using scribbles on the challenging DAVIS'17 dataset.

 Ranked #1 on Video Object Segmentation on DAVIS 2017 (mIoU metric)

Object Referring Expression Segmentation +4

Exploiting saliency for object segmentation from image level labels

no code implementations CVPR 2017 Seong Joon Oh, Rodrigo Benenson, Anna Khoreva, Zeynep Akata, Mario Fritz, Bernt Schiele

We show how to combine both information sources in order to recover 80% of the fully supervised performance - which is the new state of the art in weakly supervised training for pixel-wise semantic labelling.

Object Semantic Segmentation

Learning Video Object Segmentation from Static Images

2 code implementations CVPR 2017 Anna Khoreva, Federico Perazzi, Rodrigo Benenson, Bernt Schiele, Alexander Sorkine-Hornung

Inspired by recent advances of deep learning in instance segmentation and object tracking, we introduce video object segmentation problem as a concept of guided instance segmentation.

Instance Segmentation Object +5

Cannot find the paper you are looking for? You can Submit a new open access paper.