VoxRep: Enhancing 3D Spatial Understanding in 2D Vision-Language Models via Voxel Representation

Add code
Mar 27, 2025
Figure 1 for VoxRep: Enhancing 3D Spatial Understanding in 2D Vision-Language Models via Voxel Representation
Figure 2 for VoxRep: Enhancing 3D Spatial Understanding in 2D Vision-Language Models via Voxel Representation
Figure 3 for VoxRep: Enhancing 3D Spatial Understanding in 2D Vision-Language Models via Voxel Representation
Figure 4 for VoxRep: Enhancing 3D Spatial Understanding in 2D Vision-Language Models via Voxel Representation

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: