●researcherProvides a structured multi-turn evaluation framework for psychiatric LLMs with clinician-verified labels, useful for benchmarking dialogue agents in clinical settings.
●healthWorth watching as a rare large-scale Chinese-language psychiatric benchmark with realistic clinical distributions, relevant for teams building mental health AI tools.