Enterprise Benchmarks for Large Language Model EvaluationBing ZhangMikio Takeuchiet al.2025NAACL 2025
Are Large Language Models Effective in Clinical Trial Design? A Study on Baseline Feature GenerationNafis NeehalBowen Wanget al.2025NAACL 2025
OpenBioNER: Lightweight Open-Domain Biomedical Named Entity Recognition Through Entity Type DescriptionAlessio CocchieriGiacomo Frisoniet al.2025NAACL 2025
The Literary Canons of Large-Language Models: An Exploration of the Frequency of Novel and Author Generations Across Gender, Race and Ethnicity, and NationalityPaulina Toro IsazaNalani Kopp2025NAACL 2025
ASTER: Natural and Multi-language Unit Test Generation with LLMsRangeet PanMyeongsoo Kimet al.2025ICSE 2025
Beyond Omakase: Designing Shared Control for Navigation Robots with Blind PeopleRie KamikuboSeita Kayukawaet al.2025CHI 2025
Emerging Data Practices: Data Work in the Era of Large Language ModelsAdriana Alvarado GarciaHeloisa Caroline de Souza Pereira Candelloet al.2025CHI 2025
Responsible Prompting Recommendation: Fostering Responsible AI Practices in Prompting-TimeVagner Figueredo De SantanaSara Bergeret al.2025CHI 2025
Field Trials of Autonomous Navigation Robot for Visually Impaired PeopleHironobu TakagiKakuya Naitoet al.2025CHI 2025
“I Really Need Your Help with This Work...”: A System for Navigating the Tricky Terrain of Managing Up by Leveraging One’s Motivation to Get Things DoneSoya ParkStuti Vishwabhanet al.2025CHI 2025