Search Results for author: Alexey A. Gritsenko

Found 8 papers, 2 papers with code

Improving fine-grained understanding in image-text pre-training

no code implementations • 18 Jan 2024 • Ioana Bica, Anastasija Ilić, Matthias Bauer, Goker Erdogan, Matko Bošnjak, Christos Kaplanis, Alexey A. Gritsenko, Matthias Minderer, Charles Blundell, Razvan Pascanu, Jovana Mitrović

We introduce SPARse Fine-grained Contrastive Alignment (SPARC), a simple method for pretraining more fine-grained multimodal representations from image-text pairs.

object-detection Object Detection

Paper
Add Code

Autoregressive Diffusion Models

2 code implementations • ICLR 2022 • Emiel Hoogeboom, Alexey A. Gritsenko, Jasmijn Bastings, Ben Poole, Rianne van den Berg, Tim Salimans

We introduce Autoregressive Diffusion Models (ARDMs), a model class encompassing and generalizing order-agnostic autoregressive models (Uria et al., 2014) and absorbing discrete diffusion (Austin et al., 2021), which we show are special cases of ARDMs under mild assumptions.

Ranked #8 on Image Generation on CIFAR-10 (bits/dimension metric)

Image Generation

32,847

Paper
Code

VUT: Versatile UI Transformer for Multimodal Multi-Task User Interface Modeling

no code implementations • 29 Sep 2021 • Yang Li, Gang Li, Xin Zhou, Mostafa Dehghani, Alexey A. Gritsenko

Our model consists of a multimodal Transformer encoder that jointly encodes UI images and structures, and performs UI object detection when the UI structures are absent in the input.

object-detection Object Detection +2

Paper
Add Code

The Benchmark Lottery

no code implementations • 14 Jul 2021 • Mostafa Dehghani, Yi Tay, Alexey A. Gritsenko, Zhe Zhao, Neil Houlsby, Fernando Diaz, Donald Metzler, Oriol Vinyals

The world of empirical machine learning (ML) strongly relies on benchmarks in order to determine the relative effectiveness of different algorithms and methods.

Benchmarking BIG-bench Machine Learning +3

Paper
Add Code

A Spectral Energy Distance for Parallel Speech Synthesis

2 code implementations • NeurIPS 2020 • Alexey A. Gritsenko, Tim Salimans, Rianne van den Berg, Jasper Snoek, Nal Kalchbrenner

Speech synthesis is an important practical generative modeling problem that has seen great progress over the last few years, with likelihood-based autoregressive neural models now outperforming traditional concatenative systems.

Speech Synthesis

32,837

Paper
Code

IDF++: Analyzing and Improving Integer Discrete Flows for Lossless Compression

no code implementations • ICLR 2021 • Rianne van den Berg, Alexey A. Gritsenko, Mostafa Dehghani, Casper Kaae Sønderby, Tim Salimans

Furthermore, we zoom in on the effect of gradient bias due to the straight-through estimator in integer discrete flows, and demonstrate that its influence is highly dependent on architecture choices and less prominent than previously thought.

Quantization

Paper
Add Code

On the relationship between Normalising Flows and Variational- and Denoising Autoencoders

no code implementations • ICLR Workshop DeepGenStruct 2019 • Alexey A. Gritsenko, Jasper Snoek, Tim Salimans

Normalising Flows (NFs) are a class of likelihood-based generative models that have recently gained popularity.

Denoising Normalising Flows +1

Paper
Add Code

BriarPatches: Pixel-Space Interventions for Inducing Demographic Parity

no code implementations • 17 Dec 2018 • Alexey A. Gritsenko, Alex D'Amour, James Atwood, Yoni Halpern, D. Sculley

We introduce the BriarPatch, a pixel-space intervention that obscures sensitive attributes from representations encoded in pre-trained classifiers.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.