How Do Nonlinear Transformers Learn and Generalize in In-Context Learning?Hongkang LiMeng Wenget al.2024ICML 2024
FADAS: Towards Federated Adaptive Asynchronous OptimizationYujia WangShiqiang Wanget al.2024ICML 2024
SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement LearningShuai ZhangHeshan Fernandoet al.2024ICML 2024
PILOT: An O (1/K)-Convergent Approach for Policy Evaluation with Nonlinear Function ApproximationZhuqing LiuXin Zhanget al.2024ICLR 2024
SLM: A Smoothed First-Order Lagrangian Method for Structured Constrained Nonconvex OptimizationSongtao Lu2023NeurIPS 2023
On the Convergence and Sample Complexity Analysis of Deep Q-Networks with epsilon-Greedy ExplorationShuai ZhangHongkang Liet al.2023NeurIPS 2023
An Alternating Optimization Method for Bilevel Problems under the Polyak-Łojasiewicz ConditionQuan XiaoSongtao Luet al.2023NeurIPS 2023