Search Results for author: Alexander Peysakhovich

Found 20 papers, 3 papers with code

“Other-Play” for Zero-Shot Coordination

no code implementations ICML 2020 Hengyuan Hu, Alexander Peysakhovich, Adam Lerer, Jakob Foerster

We consider the problem of zero-shot coordination - constructing AI agents that can coordinate with novel partners they have not seen before (e. g. humans).

Multi-agent Reinforcement Learning Reinforcement Learning (RL)

Attention Sorting Combats Recency Bias In Long Context Language Models

no code implementations28 Sep 2023 Alexander Peysakhovich, Adam Lerer

Current language models often fail to incorporate long contexts efficiently during generation.

Position Retrieval

Diagnosis Uncertain Models For Medical Risk Prediction

no code implementations29 Jun 2023 Alexander Peysakhovich, Rich Caruana, Yin Aphinyanaphongs

We consider a patient risk models which has access to patient features such as vital signs, lab values, and prior history but does not have access to a patient's diagnosis.

Pseudo-Euclidean Attract-Repel Embeddings for Undirected Graphs

no code implementations17 Jun 2021 Alexander Peysakhovich, Anna Klimovskaia Susmel, Leon Bottou

Dot product embeddings take a graph and construct vectors for nodes such that dot products between two vectors give the strength of the edge.

Link Prediction Representation Learning

Evaluating and Rewarding Teamwork Using Cooperative Game Abstractions

no code implementations NeurIPS 2020 Tom Yan, Christian Kroer, Alexander Peysakhovich

We apply our methods to study teams of artificial RL agents as well as real world teams from professional sports.

Fair Division Without Disparate Impact

no code implementations6 Jun 2019 Alexander Peysakhovich, Christian Kroer

We consider the problem of dividing items between individuals in a way that is fair both in the sense of distributional fairness and in the sense of not having disparate impact across protected classes.

Fairness Recommendation Systems

Backplay: 'Man muss immer umkehren'

no code implementations ICLR 2019 Cinjon Resnick, Roberta Raileanu, Sanyam Kapoor, Alexander Peysakhovich, Kyunghyun Cho, Joan Bruna

Our contributions are that we analytically characterize the types of environments where Backplay can improve training speed, demonstrate the effectiveness of Backplay both in large grid worlds and a complex four player zero-sum game (Pommerman), and show that Backplay compares favorably to other competitive methods known to improve sample efficiency.

Reinforcement Learning (RL)

Robust Multi-agent Counterfactual Prediction

no code implementations NeurIPS 2019 Alexander Peysakhovich, Christian Kroer, Adam Lerer

We consider the problem of using logged data to make predictions about what would happen if we changed the `rules of the game' in a multi-agent system.

counterfactual

Discovering Context Effects from Raw Choice Data

no code implementations8 Feb 2019 Arjun Seshadri, Alexander Peysakhovich, Johan Ugander

An important class of such contexts is the composition of the choice set.

Computing large market equilibria using abstractions

no code implementations18 Jan 2019 Christian Kroer, Alexander Peysakhovich, Eric Sodomka, Nicolas E. Stier-Moses

Computing market equilibria is an important practical problem for market design, for example in fair division of items.

Matrix Completion

Reinforcement Learning and Inverse Reinforcement Learning with System 1 and System 2

no code implementations19 Nov 2018 Alexander Peysakhovich

Under the rational actor assumption techniques such as inverse reinforcement learning (IRL) can be used to infer a person's goals from their actions.

Recommendation Systems reinforcement-learning +1

Improving pairwise comparison models using Empirical Bayes shrinkage

no code implementations24 Jul 2018 Stephen Ragain, Alexander Peysakhovich, Johan Ugander

As such, different models of the comparison process lead to different shrinkage estimators.

Backplay: "Man muss immer umkehren"

1 code implementation18 Jul 2018 Cinjon Resnick, Roberta Raileanu, Sanyam Kapoor, Alexander Peysakhovich, Kyunghyun Cho, Joan Bruna

Our contributions are that we analytically characterize the types of environments where Backplay can improve training speed, demonstrate the effectiveness of Backplay both in large grid worlds and a complex four player zero-sum game (Pommerman), and show that Backplay compares favorably to other competitive methods known to improve sample efficiency.

Reinforcement Learning (RL)

Learning Existing Social Conventions via Observationally Augmented Self-Play

no code implementations26 Jun 2018 Adam Lerer, Alexander Peysakhovich

When there are multiple possible conventions we show that learning a policy via multi-agent reinforcement learning (MARL) is likely to find policies which achieve high payoffs at training time but fail to coordinate with the real group into which the agent enters.

Imitation Learning Multi-agent Reinforcement Learning +1

Consequentialist conditional cooperation in social dilemmas with imperfect information

no code implementations ICLR 2018 Alexander Peysakhovich, Adam Lerer

We show how to construct such strategies using deep reinforcement learning techniques and demonstrate, both analytically and experimentally, that they are effective in social dilemmas beyond simple matrix games.

Prosocial learning agents solve generalized Stag Hunts better than selfish ones

no code implementations8 Sep 2017 Alexander Peysakhovich, Adam Lerer

We extend existing work on reward-shaping in multi-agent reinforcement learning and show that that making a single agent prosocial, that is, making them care about the rewards of their partners can increase the probability that groups converge to good outcomes.

Multi-agent Reinforcement Learning reinforcement-learning +1

Learning causal effects from many randomized experiments using regularized instrumental variables

no code implementations4 Jan 2017 Alexander Peysakhovich, Dean Eckles

Scientific and business practices are increasingly resulting in large collections of randomized experiments.

Multi-Agent Cooperation and the Emergence of (Natural) Language

1 code implementation21 Dec 2016 Angeliki Lazaridou, Alexander Peysakhovich, Marco Baroni

The sender is told one of them is the target and is allowed to send a message from a fixed, arbitrary vocabulary to the receiver.

Cannot find the paper you are looking for? You can Submit a new open access paper.