Optimal and Efficient Dynamic Regret Algorithms for Non-Stationary Dueling BanditsAadirupa SahaShubham Gupta2022ICML 2022
One Arrow, Two Kills: A Unified Framework for Achieving Optimal Regret Guarantees in Sleeping BanditsPierre GaillardAadirupa Sahaet al.2023AISTATS 2023