[arXiv]score: 0.38
MM-CreativityBench Tests LMMs on Affordance-Grounded Physical Problem-Solving
May 27, 2026
MM-CreativityBench evaluates whether large multimodal models can identify non-obvious, physically feasible repurposings of scene elements — a creative affordance task current benchmarks don't cover.
cs.AIcs.CLcs.LG
HOW THIS AFFECTS YOU
●
researcherNew benchmark for probing whether LMM perception/reasoning generalizes to open-ended, affordance-grounded creative tool use beyond standard pattern recognition tasks.