Conference paperDistributionally Robust Optimization for Input Model Uncertainty in Simulation-Based Decision Making
Conference paperFinite-Time Convergence and Sample Complexity of Multi-Agent Actor-Critic Reinforcement Learning with Average Reward