Robust Deep Reinforcement Learning through Adversarial LossTuomas OikarinenWang Zhanget al.2021NeurIPS 2021
Fast Convergence for Unstable Reinforcement Learning Problems by Logarithmic MappingWang ZhangLam Nguyenet al.2022ICML 2022