Search Results for author: Dmitry Ivanov

Found 10 papers, 6 papers with code

Personalized Reinforcement Learning with a Budget of Policies

1 code implementation12 Jan 2024 Dmitry Ivanov, Omer Ben-Porat

In an r-MDP, we cater to a diverse user population, each with unique preferences, through interaction with a small set of representative policies.

Autonomous Driving Recommendation Systems +1

Mediated Multi-Agent Reinforcement Learning

1 code implementation14 Jun 2023 Dmitry Ivanov, Ilya Zisman, Kirill Chernyshev

The majority of Multi-Agent Reinforcement Learning (MARL) literature equates the cooperation of self-interested agents in mixed environments to the problem of social welfare maximization, allowing agents to arbitrarily share rewards and private information.

Multi-agent Reinforcement Learning reinforcement-learning

Neuromorphic Artificial Intelligence Systems

no code implementations25 May 2022 Dmitry Ivanov, Aleksandr Chezhegov, Andrey Grunin, Mikhail Kiselev, Denis Larionov

Modern AI systems, based on von Neumann architecture and classical neural networks, have a number of fundamental limitations in comparison with the brain.

Self-Imitation Learning from Demonstrations

no code implementations21 Mar 2022 Georgiy Pshikhachev, Dmitry Ivanov, Vladimir Egorov, Aleksei Shpilman

Modern LfD algorithms require meticulous tuning of hyperparameters that control the influence of demonstrations and, as we show in the paper, struggle with learning from suboptimal demonstrations.

Imitation Learning Reinforcement Learning (RL)

Improving State-of-the-Art in One-Class Classification by Leveraging Unlabeled Data

1 code implementation14 Mar 2022 Farid Bagirov, Dmitry Ivanov, Aleksei Shpilman

The former only learns from labeled positive data, whereas the latter also utilizes unlabeled data to improve the overall performance.

Binary Classification One-Class Classification

Optimal-er Auctions through Attention

1 code implementation26 Feb 2022 Dmitry Ivanov, Iskander Safiulin, Igor Filippov, Ksenia Balabaeva

The second is a loss function that requires explicit specification of an acceptable IC violation denoted as regret budget.

Neural Network Optimization for Reinforcement Learning Tasks Using Sparse Computations

no code implementations7 Jan 2022 Dmitry Ivanov, Mikhail Kiselev, Denis Larionov

This article proposes a sparse computation-based method for optimizing neural networks for reinforcement learning (RL) tasks.

Network Pruning reinforcement-learning +1

Balancing Rational and Other-Regarding Preferences in Cooperative-Competitive Environments

1 code implementation24 Feb 2021 Dmitry Ivanov, Vladimir Egorov, Aleksei Shpilman

Recent reinforcement learning studies extensively explore the interplay between cooperative and competitive behaviour in mixed environments.

Multi-agent Reinforcement Learning Q-Learning

DEDPUL: Difference-of-Estimated-Densities-based Positive-Unlabeled Learning

1 code implementation19 Feb 2019 Dmitry Ivanov

The objectives are to classify the unlabeled sample and train an unbiased PN classifier, which generally requires to identify the mixing proportions of positives and negatives first.

Binary Classification Density Estimation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.