COVLM: COMPOSING VISUAL ENTITIES AND RELATIONSHIPS IN LARGE LANGUAGE MODELS VIA COMMUNICATIVE DECODINGJunyan LiDelin Chenet al.2024ICLR 2024
MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D WorldYining HongZishuo Zhenget al.2024CVPR 2024