LegalWorld Simulates Full Chinese Civil Litigation Lifecycle for Agent Evaluation
June 18, 2026
LegalWorld models Chinese civil litigation as a five-stage causally connected state chain grounded in 75,309 paired judgments, with local and global memory infrastructure to maintain cross-stage consistency. LongJud-Bench, built on top, was validated by 18,992 ratings from 217 legal-background evaluators.
HOW THIS AFFECTS YOU
●
researcherLegalWorld is the first benchmark to model causal cross-stage dependencies in litigation, making it useful for evaluating long-horizon legal reasoning agents beyond isolated subtask performance.