[arXiv]score: 0.07
107-Dialogue Chess Dataset Tests Collaborative Reasoning and Deliberation
June 4, 2026
DeliChess provides 107 multi-party group dialogues where participants solve chess puzzles individually then collectively, with pre/post-discussion answers and chess engine evaluation metrics. Deliberation measurably improves group accuracy, and probing utterances are analyzed via a trained classifier.
cs.CLcs.AIcs.HC
HOW THIS AFFECTS YOU
●
researcherUseful as a structured benchmark for studying collaborative reasoning dynamics in multi-party dialogue models.