Navigating the Modern Evaluation Landscape: Considerations in Benchmarks and Frameworks for Large Language Models (LLMs)Leshem ChoshenAriel Geraet al.2024LREC-COLING 2024
ACHIEVING HUMAN PARITY IN CONTENT-GROUNDED DATASETS GENERATIONAsaf YehudaiBoaz Carmeliet al.2024ICLR 2024
Asymmetry in Low-Rank Adapters of Foundation ModelsJiacheng ZhuKristjan Greenewaldet al.2024ICLR 2024
Where to start? Analyzing the potential value of intermediate modelsLeshem ChoshenElad Venezianet al.2023EMNLP 2023
Knowledge is a Region in Weight Space for Finetuned Language ModelsAlmog GuetaElad Venezianet al.2023EMNLP 2023
DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question AnsweringElla NeemanRoee Aharoniet al.2023ACL 2023
ColD Fusion: Collaborative Descent for Distributed Multitask FinetuningShachar Don-YehiyaElad Venezianet al.2023ACL 2023
Label Sleuth: From Unlabeled Text to a Classifier in a Few HoursEyal ShnarchAlon Halfonet al.2022EMNLP 2022