FETA: Towards Specializing Foundational Models for Expert Task ApplicationsAmit AlfassyAssaf Arbelleet al.2022NeurIPS 2022
ConMe: Rethinking Evaluation of Compositional Reasoning for Modern VLMsIrene HuangWei Linet al.2024NeurIPS 2024
Dense and Aligned Captions (DAC) Promote Compositional Reasoning in VL ModelsSivan DovehAssaf Arbelleet al.2023NeurIPS 2023
Bringing Image Structure to Video via Frame-Clip Consistency of Object TokensElad Ben-AvrahamRoei Herziget al.2022NeurIPS 2022
Bringing Image Scene Structure to Video via Frame-Clip Consistency of Object TokensElad Ben AvrahamRoi Herziget al.2022NeurIPS 2022
Teaching Structured Vision & Language Concepts to Vision & Language ModelsSivan DovehAssaf Arbelleet al.2023CVPR 2023
Unsupervised Domain Generalization by Learning a Bridge Across DomainsSivan HararyEli Schwartzet al.2022CVPR 2022