[arXiv]score: 0.13
LLMs Excel at Symbolic Composition But Struggle With Real-World Reference
June 1, 2026
LLMs outperform humans on intensional (symbolic formula) tasks but underperform on extensional (real-world referent) tasks in compositional noun phrase interpretation. Tested on the Personal Relation Task, where models must resolve phrases like "Amber's parent's friend," the results suggest LLMs learn compositional structure without grounding it to world entities as naturally as humans do.
cs.CL
HOW THIS AFFECTS YOU
●
researcherWorth watching because it isolates a specific compositional grounding gap in LLMs, offering a clean benchmark split between symbolic and referential reasoning that could inform evaluation design.