[arXiv]score: 0.41
3D Primitives are a Spatial Language for VLMs
May 14, 2026
3D Primitives are a Spatial Language for VLMs shows that vision-language models can leverage 3D geometric primitives as intermediate representations to improve spatial reasoning, introducing SpatialBabel framework for this capability.
cs.CVcs.AIcs.DB