Search Results for author: Andrey Kuznetsov

Found 13 papers, 10 papers with code

Pixel-Level BPE for Auto-Regressive Image Generation

no code implementations MMMPIE (COLING) 2022 Anton Razzhigaev, Anton Voronov, Andrey Kaznacheev, Andrey Kuznetsov, Denis Dimitrov, Alexander Panchenko

Pixel-level autoregression with Transformer models (Image GPT or iGPT) is one of the recent approaches to image generation that has not received massive attention and elaboration due to quadratic complexity of attention as it imposes huge memory requirements and thus restricts the resolution of the generated images.

Image Generation

Kandinsky 3.0 Technical Report

1 code implementation6 Dec 2023 Vladimir Arkhipkin, Andrei Filatov, Viacheslav Vasilev, Anastasia Maltseva, Said Azizov, Igor Pavlov, Julia Agafonova, Andrey Kuznetsov, Denis Dimitrov

We focus on the key components that, as we have identified as a result of a large number of experiments, had the most significant impact on improving the quality of our model compared to the others.

Text-to-Image Generation

FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline

1 code implementation22 Nov 2023 Vladimir Arkhipkin, Zein Shaheen, Viacheslav Vasilev, Elizaveta Dakhova, Andrey Kuznetsov, Denis Dimitrov

The first stage concerns keyframes synthesis to figure the storyline of a video, while the second one is devoted to interpolation frames generation to make movements of the scene and objects smooth.

SSIM Text-to-Video Generation +1

The Shape of Learning: Anisotropy and Intrinsic Dimensions in Transformer-Based Models

no code implementations10 Nov 2023 Anton Razzhigaev, Matvey Mikhalchuk, Elizaveta Goncharova, Ivan Oseledets, Denis Dimitrov, Andrey Kuznetsov

In this study, we present an investigation into the anisotropy dynamics and intrinsic dimension of embeddings in transformer architectures, focusing on the dichotomy between encoders and decoders.

RusTitW: Russian Language Text Dataset for Visual Text in-the-Wild Recognition

1 code implementation29 Mar 2023 Igor Markov, Sergey Nesteruk, Andrey Kuznetsov, Denis Dimitrov

In this paper, we present a large-scale human-labeled dataset for Russian text recognition in-the-wild.

Many Heads but One Brain: Fusion Brain -- a Competition and a Single Multimodal Multitask Architecture

1 code implementation22 Nov 2021 Daria Bakshandaeva, Denis Dimitrov, Vladimir Arkhipkin, Alex Shonenkov, Mark Potanin, Denis Karachev, Andrey Kuznetsov, Anton Voronov, Vera Davydova, Elena Tutubalina, Aleksandr Petiushko

Supporting the current trend in the AI community, we present the AI Journey 2021 Challenge called Fusion Brain, the first competition which is targeted to make the universal architecture which could process different modalities (in this case, images, texts, and code) and solve multiple tasks for vision and language.

Handwritten Text Recognition object-detection +4

Feathers dataset for Fine-Grained Visual Categorization

2 code implementations18 Apr 2020 Alina Belko, Konstantin Dobratulin, Andrey Kuznetsov

This paper introduces a novel dataset FeatherV1, containing 28, 272 images of feathers categorized by 595 bird species.

Fine-Grained Visual Categorization Fine-Grained Visual Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.