Enterprise Benchmarks for Large Language Model EvaluationBing ZhangMikio Takeuchiet al.2025NAACL 2025
Are Large Language Models Effective in Clinical Trial Design? A Study on Baseline Feature GenerationNafis NeehalBowen Wanget al.2025NAACL 2025
OpenBioNER: Lightweight Open-Domain Biomedical Named Entity Recognition Through Entity Type DescriptionAlessio CocchieriGiacomo Frisoniet al.2025NAACL 2025
The Literary Canons of Large-Language Models: An Exploration of the Frequency of Novel and Author Generations Across Gender, Race and Ethnicity, and NationalityPaulina Toro IsazaNalani Kopp2025NAACL 2025
ASTER: Natural and Multi-language Unit Test Generation with LLMsRangeet PanMyeongsoo Kimet al.2025ICSE 2025
Beyond Omakase: Designing Shared Control for Navigation Robots with Blind PeopleRie KamikuboSeita Kayukawaet al.2025CHI 2025
Responsible Prompting Recommendation: Fostering Responsible AI Practices in Prompting-TimeVagner Figueredo De SantanaSara Bergeret al.2025CHI 2025
Emerging Data Practices: Data Work in the Era of Large Language ModelsAdriana Alvarado GarciaHeloisa Caroline de Souza Pereira Candelloet al.2025CHI 2025
Field Trials of Autonomous Navigation Robot for Visually Impaired PeopleHironobu TakagiKakuya Naitoet al.2025CHI 2025
“I Really Need Your Help with This Work...”: A System for Navigating the Tricky Terrain of Managing Up by Leveraging One’s Motivation to Get Things DoneSoya ParkStuti Vishwabhanet al.2025CHI 2025