A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement LearningDong Ki KimMiao Liuet al.2021ICML 2021
Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and BaselinesKeerthiram MurugesanMattia Atzeniet al.2021AAAI 2021
Decentralized TD Tracking with Linear Function Approximation and its Finite-Time AnalysisGang WangSongtao Luet al.2020NeurIPS 2020
Efficient Black-box Planning using Macro Actions with Focused EffectsCameron AllenMichael Katzet al.2021IJCAI 2021