HACKOBAR_item
[arXiv]score: 0.24

Psychologically Potent, Computationally Invisible: LLMs Generate Social-Comparison Triggers They Fail to Detect

May 5, 2026
XHS-SCoRE is a new benchmark released on arXiv targeting social comparison detection in Chinese Xiaohongshu posts, classifying reader-elicited upward, downward, or neutral comparisons as a signal distinct from sentiment. Prompted LLMs show systematic failure modes including comparison neutralization and directional skew, while supervised Chinese encoders achieve better in-domain performance. Critically, LLM-generated posts demonstrably shift perceived social standing in human readers despite the generating model being unable to reliably detect the same signal. Researchers building content moderation systems, recommendation algorithms, or mental health-aware NLP pipelines for Chinese social platforms should treat this as a concrete capability gap with measurable behavioral consequences.
cs.CL