Taming Uncertainty via Automation: Observing, Analyzing, and Optimizing Agentic AI SystemsDany MoshkovichSergey Zeltyn2025ASE 2025
DocReRank: Single-Page Hard Negative Query Generation for Training Multi-Modal RAG RerankersNavve WassermanOliver Heinimannet al.2025EMNLP 2025
Classifier-Augmented Generation for Structured Workflow PredictionThomas GschwindShramona Chakrabortyet al.2025EMNLP 2025
Mind the Query: A Benchmark Dataset towards Text2Cypher TaskVashu ChauhanShobhit Rajet al.2025EMNLP 2025
Towards Enforcing Company Policy Adherence in Agentic WorkflowsNaama ZwerdlingDavid Boazet al.2025EMNLP 2025
Divide, Link, and Conquer: Recall-oriented Schema Linking for NL-to-SQL via Question DecompositionKiran PradeepKirushikesh D Bet al.2025EMNLP 2025
Group, Embed and Reason: A Hybrid LLM and Embedding Framework for Semantic Attribute AlignmenShramona ChakrabortyShashank Mujumdaret al.2025EMNLP 2025
FactReasoner: A Probabilistic Approach to Long-Form Factuality Assessment for Large Language ModelsRadu MarinescuDebarun Bhattacharjyaet al.2025EMNLP 2025