[arXiv]score: 0.13

LLM Personality Self-Reports Fail to Predict Actual Behavior Across 25 Models

June 10, 2026

A purpose-built psychometric instrument derived from LLM behavioral data via exploratory factor analysis — yielding five factors including Responsiveness, Deference, and Guardedness — finds that self-reported scores still do not predict observed open-ended behavior across 25 models from 17 families. This holds even when the constructs are grounded in LLM affordances rather than human trait theory.

HOW THIS AFFECTS YOU

●

researcherSelf-report-based LLM evaluation is unreliable even with natively derived constructs, undermining a common shortcut for behavioral characterization.

●

policyWorth watching because personality-based alignment or safety assessments using self-report inventories lack predictive validity and should not be used as behavioral proxies.

read original ↗arxiv.org

← back to feed