ITBench: Evaluating AI Agents across Diverse Real-World IT Automation TasksSaurabh JhaRohan Aroraet al.2025ICML 2025
Representing Prompting Patterns with PDL: Compliance Agent Case StudyMandana VaziriLouis Mandelet al.2025ICML 2025