Search Results for author: Daniil Merkulov

Found 8 papers, 4 papers with code

Quantization of Large Language Models with an Overdetermined Basis

no code implementations • 15 Apr 2024 • Daniil Merkulov, Daria Cherniuk, Alexander Rudikov, Ivan Oseledets, Ekaterina Muravleva, Aleksandr Mikhalev, Boris Kashin

In this paper, we introduce an algorithm for data quantization based on the principles of Kashin representation.

Data Compression Quantization +2

Paper
Add Code

NAG-GS: Semi-Implicit, Accelerated and Robust Stochastic Optimizer

2 code implementations • 29 Sep 2022 • Valentin Leplat, Daniil Merkulov, Aleksandr Katrutsa, Daniel Bershatsky, Olga Tsymboi, Ivan Oseledets

Classical machine learning models such as deep neural networks are usually trained by using Stochastic Gradient Descent-based (SGD) algorithms.

Paper
Code

Memory-Efficient Backpropagation through Large Linear Layers

2 code implementations • 31 Jan 2022 • Daniel Bershatsky, Aleksandr Mikhalev, Alexandr Katrutsa, Julia Gusak, Daniil Merkulov, Ivan Oseledets

Also, we investigate the variance of the gradient estimate induced by the randomized matrix multiplication.

Model Compression

Paper
Code

Fast Line Search for Multi-Task Learning

no code implementations • 2 Oct 2021 • Andrey Filatov, Daniil Merkulov

But, usually, line search for the step size is not the best choice due to the large computational time overhead.

Multi-Task Learning

Paper
Add Code

A New Multi-objective Approach to Optimize Irrigation Using a Crop Simulation Model and Weather History

1 code implementation • Computational Science–ICCS 2021: 21st International Conference, Krakow, Poland, 2021 • Mikhail Gasanov, Daniil Merkulov, Artyom Nikitin, Sergey Matveev, Nikita Stasenko, Anna Petrovskaia, Mariia Pukalchik & Ivan Oseledets

Finding optimal irrigation and water resources for crops is necessary to increase the efficiency of water usage.

Paper
Code

Follow the bisector: a simple method for multi-objective optimization

1 code implementation • 14 Jul 2020 • Alexandr Katrutsa, Daniil Merkulov, Nurislam Tursynbek, Ivan Oseledets

This descent direction is based on the normalized gradients of the individual losses.

imbalanced classification Multi-Task Learning

Paper
Code

Stochastic gradient algorithms from ODE splitting perspective

no code implementations • ICLR Workshop DeepDiffEq 2019 • Daniil Merkulov, Ivan Oseledets

We present a different view on stochastic optimization, which goes back to the splitting schemes for approximate solutions of ODE.

regression Stochastic Optimization

Paper
Add Code

Empirical study of extreme overfitting points of neural networks

no code implementations • 14 Jun 2019 • Daniil Merkulov, Ivan Oseledets

In this paper we propose a method of obtaining points of extreme overfitting - parameters of modern neural networks, at which they demonstrate close to 100 % training accuracy, simultaneously with almost zero accuracy on the test sample.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.