VoxRep: Enhancing 3D Spatial Understanding in 2D Vision-Language Models via Voxel Representation Paper • 2503.21214 • Published Mar 27 • 2
3D-LLM: Injecting the 3D World into Large Language Models Paper • 2307.12981 • Published Jul 24, 2023 • 37