Scalable Evaluation and Neural Models for Compositional GeneralizationGiacomo CamposampieroPietro Barbieroet al.2025NeurIPS 2025
Flick: Empowering Federated Learning with Commonsense KnowledgeRan ZhuMingkun Yanget al.2025NeurIPS 2025
Lessons Learned: A Multi-Agent Framework for Code LLMs to Learn and ImproveYuanzhe LiuRyan Denget al.2025NeurIPS 2025
Toward a Coherent Virtual Cell Model: Probing Biological World-Model Coherence in Transcriptomic Foundation ModelsNoa MorielYishai Shimoniet al.2025NeurIPS 2025
Verifiable Chemical Reasoning through Tool-Calling Agentic WorkflowGabrielle GaudeauShinnosuke Tanakaet al.2025NeurIPS 2025
FlowState: Sampling-Rate Invariant Time Series Foundation Model with Dynamic Forecasting HorizonsLars GrafThomas Bohnstinglet al.2025NeurIPS 2025
Licence to Scale: A Microservice Simulation Environment for Benchmarking Agentic AIChristopher LohseAdrian Selket al.2025NeurIPS 2025
Foundation Models Enabling Multi-Scale Battery Materials Discovery: From Molecules To DevicesVidushi SharmaAndy Teket al.2025NeurIPS 2025
The Shepherd Test: How Will Superintelligent Agents Balance Care and Control in Asymmetric Relationships?Djallel BouneffoufMatthew Riemeret al.2025NeurIPS 2025