Search Results for author: Dmitry Ivanov

Found 10 papers, 6 papers with code

Personalized Reinforcement Learning with a Budget of Policies

1 code implementation • 12 Jan 2024 • Dmitry Ivanov, Omer Ben-Porat

In an r-MDP, we cater to a diverse user population, each with unique preferences, through interaction with a small set of representative policies.

Autonomous Driving Recommendation Systems +1

Paper
Code

Mediated Multi-Agent Reinforcement Learning

1 code implementation • 14 Jun 2023 • Dmitry Ivanov, Ilya Zisman, Kirill Chernyshev

The majority of Multi-Agent Reinforcement Learning (MARL) literature equates the cooperation of self-interested agents in mixed environments to the problem of social welfare maximization, allowing agents to arbitrarily share rewards and private information.

Multi-agent Reinforcement Learning reinforcement-learning

Paper
Code

Neuromorphic Artificial Intelligence Systems

no code implementations • 25 May 2022 • Dmitry Ivanov, Aleksandr Chezhegov, Andrey Grunin, Mikhail Kiselev, Denis Larionov

Modern AI systems, based on von Neumann architecture and classical neural networks, have a number of fundamental limitations in comparison with the brain.

Paper
Add Code

Self-Imitation Learning from Demonstrations

no code implementations • 21 Mar 2022 • Georgiy Pshikhachev, Dmitry Ivanov, Vladimir Egorov, Aleksei Shpilman

Modern LfD algorithms require meticulous tuning of hyperparameters that control the influence of demonstrations and, as we show in the paper, struggle with learning from suboptimal demonstrations.

Imitation Learning Reinforcement Learning (RL)

Paper
Add Code

Improving State-of-the-Art in One-Class Classification by Leveraging Unlabeled Data

1 code implementation • 14 Mar 2022 • Farid Bagirov, Dmitry Ivanov, Aleksei Shpilman

The former only learns from labeled positive data, whereas the latter also utilizes unlabeled data to improve the overall performance.

Binary Classification One-Class Classification

Paper
Code

Optimal-er Auctions through Attention

1 code implementation • 26 Feb 2022 • Dmitry Ivanov, Iskander Safiulin, Igor Filippov, Ksenia Balabaeva

The second is a loss function that requires explicit specification of an acceptable IC violation denoted as regret budget.

Paper
Code

Neural Network Optimization for Reinforcement Learning Tasks Using Sparse Computations

no code implementations • 7 Jan 2022 • Dmitry Ivanov, Mikhail Kiselev, Denis Larionov

This article proposes a sparse computation-based method for optimizing neural networks for reinforcement learning (RL) tasks.

Network Pruning reinforcement-learning +1

Paper
Add Code

Flatland Competition 2020: MAPF and MARL for Efficient Train Coordination on a Grid World

no code implementations • 30 Mar 2021 • Florian Laurent, Manuel Schneider, Christian Scheller, Jeremy Watson, Jiaoyang Li, Zhe Chen, Yi Zheng, Shao-Hung Chan, Konstantin Makhnev, Oleg Svidchenko, Vladimir Egorov, Dmitry Ivanov, Aleksei Shpilman, Evgenija Spirovska, Oliver Tanevski, Aleksandar Nikov, Ramon Grunder, David Galevski, Jakov Mitrovski, Guillaume Sartoretti, Zhiyao Luo, Mehul Damani, Nilabha Bhattacharya, Shivam Agarwal, Adrian Egli, Erik Nygren, Sharada Mohanty

However, the coordination of hundreds of agents in a real-life setting like a railway network remains challenging and the Flatland environment used for the competition models these real-world properties in a simplified manner.

Multi-agent Reinforcement Learning Scheduling

Paper
Add Code

Balancing Rational and Other-Regarding Preferences in Cooperative-Competitive Environments

1 code implementation • 24 Feb 2021 • Dmitry Ivanov, Vladimir Egorov, Aleksei Shpilman

Recent reinforcement learning studies extensively explore the interplay between cooperative and competitive behaviour in mixed environments.

Multi-agent Reinforcement Learning Q-Learning

Paper
Code

DEDPUL: Difference-of-Estimated-Densities-based Positive-Unlabeled Learning

1 code implementation • 19 Feb 2019 • Dmitry Ivanov

The objectives are to classify the unlabeled sample and train an unbiased PN classifier, which generally requires to identify the mixing proportions of positives and negatives first.

Binary Classification Density Estimation +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.