SOK-Bench: A Situated Video Reasoning Benchmark with Aligned Open-World KnowledgeAndong WangBo Wuet al.2024CVPR 2024
Resource- Efficient Transformer Pruning for Finetuning of Large ModelsFatih IlhanGong Suet al.2024CVPR 2024
MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D WorldYining HongZishuo Zhenget al.2024CVPR 2024