Publication

Robust Algorithms for Multiagent Bandits with Heavy Tails

July 12, 2020

People

Groups

Share this publication

Abhimanyu Dubey and Alex Pentland. Robust algorithms for multiagent bandits with heavy tails. International Conference on Machine Learning, 2020.

Abstract

We study the heavy-tailed stochastic bandit problem in the cooperative multiagent setting, where a group of agents interact with a common bandit problem, while communicating on a network with delays. Existing algorithms for the stochastic bandit in this setting utilize confidence intervals arising from an averaging-based communication protocol known as running consensus, that does not lend itself to robust estimation for heavy-tailed settings. We propose MP-UCB, a decentralized multi-agent algorithm for the cooperative stochastic bandit that incorporates robust estimation with a message-passing protocol. We prove optimal regret bounds for MP-UCB for several problem settings, and also demonstrate its superiority to existing methods. Furthermore, we establish the first lower bounds for the cooperative bandit problem, in addition to providing efficient algorithms for robust bandit estimation of location

via author preprint

Robust Algorithms for Multiagent Bandits with Heavy Tails

People

Groups

Abstract

Kernel Methods for Cooperative Contextual Bandits

Private and Byzantine-Proof Cooperative Decision-Making

Differentially-Private Federated Linear Bandits

Abhimanyu Dubey chosen as a 2019 Snap Research Scholar

Robust Algorithms for Multiagent Bandits with Heavy Tails

People

Groups

Share this publication

Abstract

Kernel Methods for Cooperative Contextual Bandits

Private and Byzantine-Proof Cooperative Decision-Making

Differentially-Private Federated Linear Bandits

Abhimanyu Dubey chosen as a 2019 Snap Research Scholar