Training-Control-as-Code: Towards a declarative solution to control trainingPadmanabha Venkatagiri SeshadriHarikrishnan Balagopalet al.2025ASE 2025
Declarative Techniques for NL Queries over Heterogeneous DataElham KhabiriJeff Kephartet al.2025EMNLP 2025
FactReasoner: A Probabilistic Approach to Long-Form Factuality Assessment for Large Language ModelsRadu MarinescuDebarun Bhattacharjyaet al.2025EMNLP 2025
Fine-Tuned Thoughts: Leveraging Chain-of-Thought Reasoning for Industrial Asset Health MonitoringShuxin LinDhaval Patelet al.2025EMNLP 2025
SIMBA UQ: Similarity-Based Aggregation for Uncertainty Quantification in Large Language ModelsDebarun BhattacharjyaBalaji Ganesanet al.2025EMNLP 2025
Synthetic Data for Evaluation: Supporting LLM-as-a-Judge Workflows with EvalAssistElizabeth DalyErik Miehlinget al.2025EMNLP 2025
KIF-QA: Using Off-the-shelf LLMs to Answer Simple Questions over Heterogeneous Knowledge BasesMarcelo MachadoJoão Pedro Porto Camposet al.2025ISWC 2025
Highlight All the Phrases: Enhancing LLM Transparency through Visual Factuality IndicatorsHyo Jin DoRachel Ostrandet al.2025AIES 2025
TerraMind: Large-Scale Generative Multimodality for Earth ObservationJohannes JakubikFelix Yanget al.2025ICCV 2025